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Preface 



This book is intended for both undergraduate students and their staff as a 
guide to individual projects, group projects and case studies in mathematics. 
It covers all aspects of setting up projects and their assessment. The bulk of 
the text is devoted to giving worked examples of the various kinds of project. 
The author has benefited from a dual career first in an engineering department 
where he spent eight years which included looking after individual and group 
projects, often of a very practical nature. For the last twenty years, he has 
been within mathematics but still supervising different kinds of projects. This 
book is therefore written largely from experience. Mathematics here excludes 
statistics and operational research. 

In the last few years there has been pressure on all undergraduate courses to 
include student centred learning. In mathematics this is difficult because most 
of the curriculum is devoted to learning mathematical skills. These skills are 
invariably assessed by examination as it is only in the controlled environment 
of an examination that the assessors can be sure who is being assessed. One 
does not assess skills such as spelling or piano grades by setting coursework or 
projects, hence many would ask how can you assess mathematics other than 
by a closed book examination? My reasoning is that with the increase in ac- 
cess the nature of many mathematics degrees has changed. If mathematics as 
a discipline is to keep pace with other undergraduate subjects it has to include 
modules that enable the student to develop in the broad sense. This means 
students must have the opportunity to work by themselves on a mathematical 
topic, or the opportunity to work with three or four other students towards a 
common goal. It is recognised that this must not be compulsory as there must 
be room for the clever student who has mathematical flair and may go on to a 
Fields Medal. This is the mathematics equivalent to the Nobel Prize. Inciden- 
tally there is a story that there is only no Nobel Prize in mathematics because 
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Alfred Nobel’s wife eloped with a mathematician! As Nobel never married, the 
story cannot be true. However, perhaps he lost out in love to a mathemati- 
cian. It is also said that Nobel had an intense personal dislike of the Swedish 
mathematician Gosta Mittag-Leffler who was attractive to the ladies and would 
have been the obvious candidate for the Nobel prize in Mathematics. Whatever 
the truth, it is a sad omission but as it happens many who have graduated in 
mathematics have gained a Nobel Prize-Paul Dirac (Physics 1933), Bertrand 
Russell (Literature 1950) and Richard Feynman (Physics 1965) to name but 
three. 

These days mathematics must not be seen as an elitist subject only to be 
taken by specialists, but as a useful subject which many can benefit from study- 
ing. The usefulness of mathematics should be obvious to scientists, engineers 
and economists particularly as even the softer areas of these subjects become 
more quantitative, but its service as a basic discipline that aids the underlying 
thought processes needs emphasising too. Mathematics graduates have gone 
on to be politicians and lawyers as well as to succeed in the creative arts. The 
cry of “what use is Pure Mathematics” should be ignored; who in their right 
mind questions the use of music, literature or fine art? Mathematics has always 
been the most transferable of skills, but mathematicians have not traditionally 
been exposed to project work and group work as undergraduates. Traditionally, 
the mathematics undergraduate has had no forum in which they can discuss 
approaches to a problem with fellow students, or to voice their own opinions. 
This is no longer acceptable, and this text will help both students and lecturers 
see how projects and case studies can work well in mathematics. 

In the text, there are passages that are verbatim extracts from student 
projects. These are indented and in a smaller font. In other places, the text may 
state that the following “was done in the project” or some such phrase. In these 
instances, the author has not copied the student’s work but has used the same 
mathematical method or proof so that the reader understands how the student 
has approached a particular piece of mathematics. In all cases the author would 
like to thank all the students who have contributed projects and group work to 
this text, particularly those who he has supervised. Their anonymity has been 
preserved although they will easily recognise their own projects one hopes. 
Three individual projects have been included in their entirety in three separate 
appendices. Heartfelt thanks go to the three students concerned for permitting 
this, especially those for whom the project experience was less than ideal. The 
prime purpose of this text is to learn from the examples of others and to 
encourage students to try projects and case studies. One can often learn much 
from reading the works of other students, so these examples are very valuable. 
For the record, these projects have been faithfully reproduced as closely as 
possible from the originals. This means that the figures were scanned in and 
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therefore deliberately do not match the usual high standards of Springer-Verlag. 
On the other hand, the use of I^TeX has beautified the look of the mathematics, 
if not its content! 

Thanks also go to the staff of the Mathematics and Statistics Department 
at the University of Plymouth. Many of the ideas, particularly those on assess- 
ment come from them. I thank those publishers who have permitted me to use 
figures from their books. Finally a big thank you to Noel Ford who drew all 
the cartoons to my sometimes less than well formed ideas. 

Phil Dyke 
September 2003 
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1 

Introduction and Organisation 



It is only in the last twenty years or so that mathematics courses at degree 
level have been assessed by anything other than examinations. At school level 
in the UK, GCSE Mathematics has always contained continuous assessment 
as part of the portfolio since the introduction in the late 1960s of its prede- 
cessor the CSE. This was controversial at the time and remains so amongst a 
significant number of teachers and academics, especially those in higher educa- 
tion in mathematics. The main problem lies in the accuracy of the assessment. 
Cheating is easy and difficult to detect unless the assessments are carefully de- 
signed. Much of mathematics is a skill acquired by practice much like playing 
a musical instrument or participating in a sport. No-one would assess a con- 
cert performance using continuous assessment. Of course other more discursive 
subjects seem to have always had projects and case studies. Mathematics itself 
can be partitioned into various sub-categories: of these only statistics, oper- 
ational research and to some extent numerical methods have taken naturally 
to this kind of assessment. In fact, excluding only the most theoretical prob- 
ability, it can be argued that the whole raison d’etre behind statistics is to 
get meaning from data, to analyse, infer and predict, and data sets come from 
a wide variety of sources such as biology, economics, sport, etc. Operational 
research too is all about ffnding optimal routes, the best strategies, algorithms 
for the most compact packing and so on and so requires input of data from 
subjects that are outside mathematics. Therefore projects and case studies are 
natural vehicles for assessing the ability of the student or group of students to 
apply statistical or operational research techniques. The rest of mathematics 
is rather different, so much so in fact that projects and case studies centred 
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around statistics and operational research have been explicitly excluded from 
this text. Applied mathematics has a different problem. If the modelling effort 
shifts to be focussed entirely on the problem, the discipline risks becoming la- 
belled as engineering, applied physics or the subject in whatever pigeon hole 
the problem lies. Given a real application, applied mathematics can cross the 
border. In fact there are national differences and most of what is termed phys- 
ical applied mathematics in the United Kingdom lies outside mathematics in 
the rest of Europe and the USA. In the USA, differential equations are applied 
mathematics, but mechanics, fluid mechanics and electromagnetic theory are 
parts of physics. Here we retain the broader UK definition of Applied Math- 
ematics. Projects in pure mathematics are still very much a rarity, but are 
becoming more commonplace. It is not impossible to base projects around the 
subjects of algebra, analysis, number theory, topology, geometry or differential 
equations, with the latter two being an extremely rich source. For example 
algebra with number theory is the basis of codes and cryptography. The first 
two appendices give examples of projects based on pure mathematics. 

An external reason for moving away from examinations towards other forms 
of assessment lies in the changing nature and pattern of work. Talking directly 
to students, it seems more and more likely you can now expect to change jobs 
much more frequently than your equivalents ten or twenty years ago. It is 
therefore the duty of University courses to educate you in such a way that you 
can cope with this. If all the assessment was in the form of examinations, the 
tendency would be (in mathematics) for you to train yourself to be able to do a 
range of problems and regurgitate these to answer examination questions. There 
would be no need to communicate with others, not even fellow mathematicians. 
No need to be able to describe in layman’s terms a piece of mathematics. 
No need to write precisely and understandably. Yet it is these primarily non- 
mathematical skills that are required in interview situations and in situations 
where you need to convince others of your worth. There are now many more 
graduates, with the number edging towards 50% of the population in the UK. 
Therefore it is essential that all students have as part of their degree something 
that prepares them for the real world, and for that all important job interview. 
Not just the interview of course. Once successfully past the interview stage, 
these qualities also form important employment skills. Of course, a good degree 
is one mark of distinction, but an unusual project or case study is icing on the 
cake as it gives you something extra to enthuse about at an interview or in 
discussions with prospective employers. 

A detailed discussion of assessment is covered in the next chapter. There 
are various types of projects and case studies which are now briefly described 
in terms of their organisation. It is expected that with experience and with the 
changing nature of higher education and associated technology, other forms of 




1. Introduction and Organisation 



3 



project and case study will emerge that combine some of the features outlined 
below as well as incorporate new ones. 



1.1 Individual Projects 

The essence of an individual project is that it is a piece of work completed by 
just one person. The advantages of this are obvious. The topic can be geared to 
the desires of an individual, special interests can be catered for, you can work 
at your own pace and on completion there is a project report that you can 
be proud of that is all your own work. There are, sadly, some disadvantages 
too. Overworked academics can be hard to find for advice, especially given the 
larger student/staff ratios, all too prevalent these days in the continual drive for 
greater efficiency. Individual projects can be a lonely enterprise; it is difficult 
to share triumphs and disasters with fellow students, there is no opportunity 
for co-operation as there might be in tutorial exercises or other coursework. 
However even with these disadvantages the individual project is now a common 
and popular part of the undergraduate mathematics curriculum. 

The individual project is a firm part of most final year degrees in other 
subjects. Some would say that it is arguably the most important part. In en- 
gineering, the degree would not be accredited without it for it gives the op- 
portunity for the student to show understanding and flair as well acting as the 
focus for many of the topics covered in the formal lectures. It has assumed par- 
ticular importance these days as modular courses are now common, and these 
can lead to an over-compartmentalised curriculum. The project then becomes 
a vital glue for the final year modules. In vocational subjects it can also relate 
the curriculum more closely to the world of work and help the student to get 
that all important first job. These statements are also true for mathematics. 
There is no doubt that many a final year project has helped to steer a student 
into a particular career. In a degree that has a sandwich year, which is a year 
spent in industry usually just prior to the final year, the final year individual 
project can be successfully used to integrate this experience into the degree. A 
successful sandwich year can also lead to that all important first job. 

Individual projects in any year other than final is rare, and in my experience 
unknown in mathematics. They are not considered here. 

Sometime towards the end of the second year of a full time mathematics 
degree that contains an individual project, a handout will normally be issued to 
all students. This can be in the form of a list of available staff and their interests, 
or it can simply be a list of available projects. My personal preference is to 
give the student some flexibility, but also some guidelines in terms of general 
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topics available. A project too well defined begins to resemble just another 
lecture course: on the other hand a project designed completely by the student 
is invariably over-ambitious. In Plymouth I would be rich if I had £10 every 
time a student has approached me with a desire to do “something to do with 
sailing or windsurfing” ! The important point is for you to discuss potential 
projects with one or more potential project supervisors. Ideally you will have 
chosen the project quickly enough to be able to do preliminary work over the 
summer break, at least theoretically! However, this is not essential and may 
be impractical as many of you are far away indeed from relevant reference 
material, computers or even a pencil in the summer months. One final but 
important point about year long projects: some staff need to be reminded 
that the project is undergraduate and that it lasts for less than a year and 
is (usually) less than 20% of the curriculum. It is not postgraduate and it is 
definitely not an MSc or a PhD! Your personal tutor is a good choice to issue 
this reminder: if your project supervisor and personal tutor are one and the 
same try another sympathetic ear in the Department, going to the Head of 
Department if necessary. Assessing individual projects forms part of the next 
chapter, and examples of projects are given in Chapter 3. 

Sometimes individual projects last less than a whole academic year. In fact 
it is not rare for a project to last one semester which is half an academic year. 
In this case, the work has to be telescoped into half the usual duration. The 
above advice still applies, with the comment that it is important to remember 
the reduced time available. The subject of the project must be chosen promptly, 
and help and guidance with regard to the management of the project needs to 
happen within the first few weeks of commencement. Normally, supervisor and 
student meet weekly to monitor progress, and it is always important to make 
these contacts purposeful, no more so than for a project that only lasts half a 
year. Whatever, it is important that students have a quiet place to work which 
is free from distractions when writing their project. If your digs or student 
room is noisy, explore the possibility of booking somewhere in the library, or 
it might be possible to get a small room somewhere on campus set aside for 
project students. 

Some courses offer “mini-projects” . These will be short pieces of work that 
last two or three weeks maximum. Each student is set a task to do within this 
short time span and at the end submits a report of five to eight pages. This 
only differs from standard coursework because each student is set or chooses 
something different. The report is the only assessment, and it is usually worth 
no more than 20% of the module. This kind of mini-project is not explored 
here. 

In the examples of Chapter 3, attention will be restricted to the year long 
and half year projects. 
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1.2 Group Projects 

Group projects are perhaps less common, and are certainly very rare in mathe- 
matics. There are many reasons for this, the most apparent being the difficulty 
in assessing individual contributions to group projects with accuracy. The whole 
question of assessment will be addressed in the next chapter. 

Nevertheless group projects do exist and for at least one good reason. They 
provide an excellent vehicle for co-operative work. Mathematics is not renowned 
for encouraging work in groups: almost uniquely mathematicians can be very 
successful working by themselves in total isolation. Arguably the best and most 
innovative work is still done this way. A recent example is the proof of Fermat’s 
Last Theorem by Andrew Wiles. However ordinary mortals who are the bulk 
of students reading mathematics will normally end up working alongside other 
professionals (who will not be mathematicians but perhaps engineers, scientists, 
business graduates or graduates in one of the humanities) in commerce or 
industry. The group project thus provides a valuable introduction to working 
and getting along with others. Interesting questions are how big to make the 
group, and how to choose who is in each group. Groups of three, four or five 
are probably ideal. I have also worked with a group of two and a group of 
six. The group of two was quite successful, but the scope of the project was 
limited: many hands make light work! On the other hand a large group can 
result in the quieter members being overlooked and partially squeezed out of 
the final group project report. This could result in the student being unsatisfied 
with the experience: too many cooks spoil the broth perhaps! Some lecturers 
allow groups to form naturally. This has the merits of giving the student the 
opportunity (and responsibility!) for the make up of the group but can lead 
to an uneven spread of ability. All the good students in one group is less of a 
problem than having a group of struggling students. On the other hand, forcing 
students into a group could be counter-productive and merely lead to disputes 
and non-participation. My own preference is to allow the students to organise 
themselves into groups for group projects. It is important in group projects for 
the tutor to play a very active role at the beginning. Students would usually 
be unfamiliar with the details of a group project, so a handout such as the one 
shown would be a good idea. 

The square brackets after each assessment item would give the actual date 
these were due. The module leader would distribute this handout to the whole 
class. This would be followed by an introduction to each group project topic 
with perhaps a forty minute run through outlining the type of mathematics that 
might be contained in each project together with some ideas on the possible 
tasks each member of the group may undertake. Some discussion would then 
take place about which were likely to run, whether there were any project ideas 
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Group Project 

Organisation 

A set of students, minimum number 2, maximum number 6, preferred number 3-5, work as a 
team on a project. This is called a Group Project. 

At the beginning of the module, there will be a short sequence of lectures explaining each 
project. At the end of the lectures, students will be expected to choose one of them to work 
on. This does not preclude students from coming up with their own ideas for projects, but 
any ideas must be vetted for suitability by the module leader, and there must be a group 
of students willing to work together on it. In practice each project consists of a number of 
topics that individuals work on. There is a handout accompanying each project that includes 
suggested topics, however these are only suggestions and students working on the project are at 
liberty to formulate their own strategies for completing the project. Again, the module leader 
is there to help and provide guidance on, for example equalising workload within a project 
group. The goal is to produce a project report by the end of the period to which all the group 
has contributed more or less equally, and with which all members of the group is proud to be 
associated. 

The module is worth 10 credits (there are 120 credits in the year) so a rough guide is that each 
student should spend 100 hours on their contribution. 

After the initial lectures, formal class contact is restricted to one hour per week when the 
module leader is available in classroom for general consultancy, making general points from 
matters that arise, giving advice, etc. There are two other timetabled hours in the week when 
the module leader is available in his/her office. This should be used for specific queries, usually 
but not necessarily of a technical nature. 

The choice of who does a particular project is largely up to you. You will be asked to state 
preferences, and these will form the basis of the split into groups. Note that it is perfectly 
possible for more than one group to work on the same project. Usually the projects have 
enough scope for many groups. The module leader does have the final say as it may be in your 
own interest to ensure that the composition of each group has roughly equal mathematical 
ability. 

Assessment 

The assessment of this module will be as follows: 

1. Contribution to final report [hand in deadline] 

70% 

2. Interim Report [hand in deadline] 

10 % 

3. Verbal report to the whole class [date of orals] 

10 % 

4. Peer assessment by other members of the group. 

10 % 



from the student body, and some further discussion on peer assessment. This 
latter topic may the subject of a further handout (see the next chapter). All 
this is to give the students enough information to enable them to make up their 
minds. Some specific examples of group projects are given in Chapter 4. 
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1.3 Case Studies 

The case study varies in concept and execution. One possibility is that the 
case studies module of a mathematics degree consists of formal lectures and 
assessments much like any traditional module on, say, differential equations. 
Each case study is perhaps spread over two or three lectures and students 
are assessed on their understanding of it either then and there with a test or 
coursework, or at the end of the module via an examination. This kind of case 
study module is not all that different from a standard lecture course. There is a 
kind of case study module that is distinctive which might run as follows. A series 
of lectures is given at the beginning of the academic period (term or semester). 
These are a mixture of the technical and the managerial. Technical lectures 
introduce each case study whilst the management lectures tell the students 
about how they will be assessed and how they will interact with the rest of the 
group. Typically each case study is very different: one may have arisen from an 
industrial problem whereas another may be research paper based, and a third 
can be a mathematical topic which is not new to mathematicians, but is new to 
the students and different from the rest of their course. Some may even involve 
a group of students working off campus with confidential data! This however 
is rare in mathematics. 

The case study can be run in a similar way to a group project, but here 
there is a stronger case for choosing groups for the first case study then rotating 
the membership for a new one. For both case studies and group projects, there 
may be student concern over equality of workload. For example those in a 
group whose project concerns analytical dynamics or quantum mechanics may 
feel that their project is harder than one on graphs and networks. My view 
is that of course all projects and case studies are different and therefore some 
are harder than others; however the assessment is adjusted accordingly. The 
potentially serious problem is where there is one very strong student in a group 
of less able students. This student could end up doing most of the work on 
the one hand, or not achieving his or her potential on the other. Some of these 
difficulties are faced when we consider assessment in the next chapter. 
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Assessment 



2.1 Introduction 

Assessment is the most important aspect of education to get right at all times, 
but especially when running a project or case study. These days students are 
not backward in telling you when things are going wrong, and if it involves their 
marks so much the worse. It is also true that now it seems more difficult to get 
students to do anything out of altruism than it used to be. It was once the case 
that students would expect “homework” to be set regularly, be marked and 
returned regularly. This was seen to be a standard part of any lecture course 
in mathematics. Nowadays, if homework is set too many students will not do 
it, even if this is accompanied by an apology and a Harrison Ford like lopsided 
grin. Despite it being helpful, a fact often duly recognised by the students 
themselves, the fact that it is not essential means that it does not get done. Or 
more accurately homework tends not to get done by those who would benefit 
most by doing it. Although educational theorists still talk about formative and 
summative assessment, in practice the distinction between the two is often 
fuzzy. After the first few weeks, everything students do usually needs to be 
connected to a mark that contributes to the final grade. Or, at the very least, 
be seen to be extremely useful towards gaining marks. In this chapter we tackle 
the all important question of how to assess individual projects, group projects 
and case studies, although it is the first two that take the lion’s share of the 
chapter. 

An important aspect is to make sure the student is committed quickly after 
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the start of the module. One way of ensuring this is to set an assessment quite 
near the beginning. Such an assessment need not be particularly summative, 
i.e. it need not contribute much toward the final grade, just enough to convince 
recalcitrant students to do something. It is largely formative and is there to 
help the student to know he is on the right lines. We call this assessment the 
“interim report”. 



2.2 Interim Reports 

Interim reports are found in group projects and in individual projects. Gener- 
ally they are absent from case studies. The group project, as the name implies, 
is a group of students working towards the production of a common piece of 
work. On the other hand the individual project is done by one person, with 
greater or lesser assistance from the supervisor. The role of the interim report 
is therefore subtly different in each case so let us discuss them separately. 

To begin with individual project interim reports, these are primarily an 
indication that progress is being made with the project. Advice on how to 
write them is postponed until the next chapter. Here we shall concentrate 
solely on assessment. The supervisor should not be surprised by its contents, 
but it provides the student with an opportunity to review progress. It should 
also be assessed by another member of staff who can provide an independent 
view. The contents can be technical, outlining the mathematics done so far and 
signalling future work. Or it can be a general overview, describing what has 
been done. It is largely formative, but it is a good idea for it to attract some 
mark which is worth up to 10% of the total project mark. This mark should 
be the average given by the two assessors. 

Group project interim reports are a little different. They are individual 
efforts, perhaps the only part of the group project that falls into this category 
(but see the next section on verbal presentations). Once more, guidance on how 
to write an interim group project report is found in Ghapter 4, although it is 
worth looking at the guidance in Ghapter 3 too as there are many similarities. 
The purpose of the interim report is still to assess progress so far, but it is also 
to help each student to identify his or her role within the group. A group project 
is based on the collective work of a group of students. In any particular group 
there may be dominance. One student may be physically dominant through 
appearance, a booming voice, a domineering attitude, etc. More likely one 
student could be intellectually superior, the boffin of the class perhaps. Another 
possibility is that the group consists of a “clique” plus an outsider. It is all too 
easy for this outsider to feel ostracised and leave the project for the rest to do. 
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In all of these cases, circumstances could lead to one or more members of the 
group not contributing sufficiently. The interim report forces every member of 
a group to home in on a sub-topic, do some work on it and write up what they 
have done. It also gives the module leader an indication of what each member 
of a group has done and will do, hence providing a valuable oversight of the 
entire project. Each interim report should still be second marked, although 
the final mark in this case could be agreed (over coffee perhaps) rather than 
a straight arithmetic mean. The mark given to the interim report here could 
be 15% rather than 10% to emphasise its importance. It is very important in 
both cases to give positive feedback to each student. At this early stage it is 
all too easy for students to become disheartened about the project or to go off 
at a tangent, or to get stuck. The feedback from the interim report is a useful 
vehicle to help the student formally. More seriously, if there is a severe problem 
later (for example health related), evidence from this stage can be very helpful. 



2.3 Verbal Presentations 

These days it is important for every student to get the opportunity to give 
a presentation. The era of the bright graduate in mathematics or engineering 
with a first class degree who can only mumble incoherently in interviews or 
meetings has passed. Being good on paper and in passing examinations is now 
only part of the story. All forms of project and case study can provide good 
vehicles for students to get used to presenting to others. Let us run through 
different ways this can be done and outline assessment procedures. 

The most obvious, and the most daunting for the student, is to prepare a 
talk for the whole class. This is most often done in the context of an individual 
project where the student is given the opportunity either half-way through the 
year or at submission time to tell everyone about it. It has the advantage of 
being something that is new to the bulk of the audience (fellow students) and 
the staff present are normally sympathetic and do not ask too many awkward 
questions at the end. Nevertheless it is an ordeal for most students, particularly 
if they have not done anything like it before. The secret is to prepare well and 
run through the material a number of times before the event. Use of visual aids 
is encouraged. These used to be overhead projector transparencies written on 
or containing photocopied writing and equations, but increasingly now include 
PowerPoint presentations or computer projections and video clips. Here is a 
short list that students will find useful. 
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JUST RUN THROUGH 
WILES' CORRECTED PROOE 
or EERMAT'S LAST THEOREM 
rOR ME, WOULD YOU? 



Figure 2.2 

tions. 



Staff are normally sympathetic and do not ask awkward ques- 
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In a good formal presentation the speaker should: 

1. Await formal instructions from the Chairman or Announcer. 

2. Thank the Chairman or Announcer. 

3. Give a proper formal introduction to the talk in the form “I am .... from 
.... and my talk is on ....”. 

4. Always address the audience. 

5. Keep the audience interested. 

6. Give an outline of the talk at the start by going through a list of contents. 

7. Present material in a logical, structured manner. 

8. Use appropriate visual aids, usually overhead projector transparencies or 
PowerPoint. 

9. Make sure the information on visual aids is clear and readable, and make 
sure they are not overcrowded. 

10. Keep within the allotted time. 

11. Conclude the talk properly (e.g. run through a list of conclusions). 

12. Thank the Chairman and audience for their attention. 

13. Respond clearly, concisely, correctly and politely to questions. 

The presentation skills are of course important, but what must be right is the 
technical content. Students should remember that lecturers can see through the 
most glamorous presentation and soon detect any flaws or misunderstandings 
in the material of the project. The assessment of such project talks is done 
against two principal criteria, the mathematical content and technical level of 
the talk, and presentation skills. The weighting is either 50% each or 60%: 40% 
in favour of the content. There may be a heavier weighting in favour of con- 
tent, but the presentation side should not be completely ignored. Again, these 
days verbal presentations are double marked to ensure fairness and quality. 
The verbal presentation is usually about 10% of the total individual project 
mark. Normally, students find the experience of standing and presenting very 
nerve wracking, and it is quite useful to be given some general guidelines on how 
performance relates to marks. The following list matches performance to grade: 




2. Assessment 



17 




Figure 2.3 



Students should keep within the allotted time. 
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Classification Presentation 

First Clear, insightful, good 

answers to probing questions. 

Upper Second Clear explanations, 

able to answer questions. 

Lower Second Some good explanations and answers, 

but sometimes unclear or unconvincing. 

Third Acceptable explanations of part of project, 

partial answers to some questions, 
but otherwise confused. 

Fail Completely muddled, or missed 

the point, or did not turn up! 

Students do not need convincing of the usefulness of giving a verbal presen- 
tation, both from the point of view of learning the material of their project 
and also in instilling confidence in situations such as the job interview. Many 
interviews these days include an element of presentation and students who have 
done this kind of thing before start with a great advantage. 

In group projects, there is normally one group project presentation, but 
all members of the group must participate. This can be all group members 
taking a turn at the OHP, but it can be more adventurous. For example, a 
simulated interview, or one student taking on the role of Master of Ceremonies 
and introducing the rest in turn. This can present difficulties if the amount of 
time each member of the group is at the podium, so to speak, is very different. 
The assessor should take account only of the contribution in terms of technical 
content and presentation and this ought to be independent of the actual number 
of minutes it takes. The only time the student is penalised is if the presentation 
is far too long due to lack of discipline or organisation of material, or far too 
short due to ill-preparedness. Whatever format is used and however long each 
contribution takes, it is possible to give each participant an individual mark 
although inevitably there is less differentiation between content marks than 
between presentation marks. 

There are other forms of verbal presentation that are well worth trying. 
One is the viva-voce, or viva for short. This is best suited to the individual 
project, but could be tried elsewhere. Normally the student is questioned for 
about fifteen minutes. The questioning is done by the assessor who may or may 
not be the project supervisor. In fact, an arrangement that works well is if the 
questions are led by an independent assessor with the project supervisor tak- 
ing the role of the informed assessor, mostly listening but chipping in the odd 
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question. It is important that the supervisor is involved in assessing as a final 
year student often does not do justice to his or her knowledge in the stressful 
situation of a viva. The presence of the supervisor normally acts to calm the 
student, although of course one can always think of exceptions! This contrasts 
with the PhD viva in which the supervisor is definitely “prisoner’s friend” and 
normally takes no role as assessor. Some postgraduate research students elect 
not to have their supervisor present: this has not been my experience for under- 
graduate student projects. It is important that there are two assessors for the 
project, and in the unlikely event that the student objects to one of these being 
his supervisor, two others have to be appointed. There is no point arguing with 
the student about this; however it must be made clear that there are severe 
disadvantages not allowing the supervisor to assess the project. For example 
the two people assessing the project may be unfamiliar with some of the cir- 
cumstances of its production and not give credit at the correct weighting for 
work done. The assessment in this case is unlikely to be accurate. On the other 
hand to force a student to include an assessor/supervisor who, for whatever 
reason, is biased is to invite an appeal. Appeals are messy and best avoided. At 
the close of the viva, the student leaves the room and the two assessors agree 
a mark. This mark could be for the whole project and not just the student’s 
performance at the viva. Indeed it is important that each student knows the 
criteria under which the viva is marked. Usually it is presentation skills and 
depth of knowledge shown. It sometimes happens that a student breaks down 
under vicious and over penetrative questioning. This is of course most unfor- 
tunate and may be the fault of an over zealous assessor who usually bears no 
malice but gets caught up in the subject matter of the project to such an extent 
that (s)he forgets that it is only an undergraduate student on the receiving end 
of questions and not an eminent professor. The supervisor should spot when 
this is happening and help the student, but if this does not happen there could 
once again be grounds for appeal. It is useful for a passive third assessor to be 
present in case of disputes. Finally of course telephone calls, knocks on the door 
and other external interference need to be eliminated as with any confidential 
meeting. I once conducted an oral examination with a raging storm outside 
which, with only partial success, we all tried to ignore! 

Normally the viva is not marked separately and if it is, it is only given up 
to 10% of the marks. Oral presentations on the other hand are usually marked 
separately, again usually attracting 10% of the marks as a maximum. 
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Figure 2.4 

student. 



The presence of the supervisor normally acts to calm the 
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2.4 Final Report 

The actual writing of the report for the individual project is a subject tackled 
in the next chapter. Here we concentrate on its assessment. 

Quite naturally, the final report is the ultimate goal and provides the main 
criterion by which to judge the success or otherwise of the entire venture. If 
more than one student is involved in producing the project report, then it is 
usually invidious to try and mark each contribution separately. Far better to 
give a single mark for the whole written report and use the other assessments 
to differentiate between students. 

Some attention is now given to developing the criteria needed to arrive at a 
mark. To read a large piece of written work and then to decide “this is worth 
cc%” is really not satisfactory and can leave academic staff open to student 
appeal. One way to be more helpful is to tell students how to get high marks 
in the written project. Here is a typical list: 



Classification Comment 



First 

Upper Second 
Lower Second 
Third 
Fail 



Elegant, excellent English, well strnctnred, 
very few typographical errors. 

Concise, good English, clear structure, 
not too many typographical errors. 
Acceptable English, some structure, 
not very well proof-read, rambling prose. 
Some acceptable writing, some evidence 
of planning, many errors, muddled. 
Illiterate and confused. 



It is emphasised that these are guidelines and that, for example, not all third 
class projects will be muddled. There will be some guidance also about the 
amount of time that should be spent on the project. Ten hours for every mark 
is consistent with 120 credit points every academic year. Since 1200 hours 
equates with 400 hours each term, and given a 10 week term, this implies a 40 
hour week which is about right. More specific indications of attitude are given 
in the following list: 
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Classification Typical commitment 



First 

Upper Second 

Lower Second 
Third 

Fail 



Enthusiastic, worked largely independently 
of supervisor, found own sources. 

Found some extra references, 
diligent and well read. 

Read the given texts thoroughly. 

Read the given texts, but was 
uncritical and sluggish. 

Did not find or read 
anything relevant. 



So far nothing has been said about mathematical accuracy. A very well struc- 
tured report, well written, thoroughly referenced but full of mathematical errors 
will not get good marks! Obviously it is also true that projects vary as to their 
mathematical sophistication. A student who is struggling with the technical 
aspects of mathematics is unlikely to do well in an individual project such as 
the hypergeometric functions project outlined in Section 3.7. However a project 
which is less mathematically demanding may well provide a good vehicle. The 
thorny question about levels of difficulty then arises. Is it permissible for a 
student to undertake a project with, in the extreme, no mathematical con- 
tent? Perhaps the biography of a mathematician? The consensus seems to be 
that a certain level of mathematics is essential. For example, mathematical 
biographies, in addition to being scholarly need to contain a reasonably de- 
tailed account of the mathematical breakthroughs made by the subject of the 
biography. If the biography is of an ancient mathematician, say Archimedes 
or Apollonius then the report needs to contain a substantial amount of the 
mathematics of finding the area of a parabola using the method of indivisibles, 
or the geometry of spirals or whatever. An essay on the sociology of the time 
and some lamentation over all the lost Greek primary sources will not do. Then 
there are the projects that emerge from the general area termed Mathemat- 
ics Education. Judging the quality of a project based around the impact of 
calculators on ‘A’ level mathematics against that on hypergeometric functions 
remains very difficult and the subject of some heated debates. The best advice 
to students is to do some advanced mathematics if at all possible or risk a poor 
project mark. 

Copying has always been a problem to some extent, but in these days of the 
internet and floppy disc, it is so much easier to do and harder to detect. There 
are cases of completely plagiarised PhD theses that have gone undetected for 
years, and I am sure this is also true at undergraduate level. Most universities 
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have in place codes of conduct and severe disciplinary rules which prevent 
blatant plagiarism. However, the line between research and copying is a fine one 
especially at undergraduate level. Students must reveal all sources and include 
them in the reference list, even if this reference is a past project by an ex-student 
from a few years back. Building on the work of a previous undergraduate is 
commendable: copying from a previous project is unforgivable. 

Although the emphasis above has been on assessing the written individual 
project report, much of it applies equally well to assessing any project report. 
The only additional aspect of a group report is assessing the integration of the 
different contributions. If this is well done, and an external examiner finds it 
hard to detect where one contribution ends and the next starts then credit 
should certainly be given. If it is entirely obvious that the report has been 
written by, say four individuals, then I am not convinced that any penalty 
should be exacted. On the other hand if each part is done in different type size 
and font, or there is discontinuity in page numbers (or no page numbers at all), 
or say two Figure 8’s this detracts from the quality of the report and the mark 
must reflect this. 



2.5 Moderating 

Moderating is the name given to checking that all marks are fair and reason- 
able. In projects, this is done in several ways. In an individual project, it is 
a good idea for there to be a second marker for the project. Both attend any 
seminar or oral presentation involving the student, and both read and mark the 
project. The two assessors then agree on a mark, and this becomes the mark 
submitted to the examination board. In the event of disputes, some institutions 
involve a third assessor who acts as a referee. Each assessor writes a report on 
the project, and justifies the mark awarded. If the two grades are widely dif- 
ferent, these reports should say why. A third assessor then brings judgement 
to bear and either agrees with one or other of the assessors, or strikes a happy 
medium. The third assessor’s ruling should be final. If both assessors make sim- 
ilar comments, but award different marks, for example: “superb piece of work” 
50% and “superb piece of work” 70% then the criteria given in the section 
above should be used to remind the first of these assessors that a “superb piece 
of work” equates to first class honours and ought to be awarded 70% rather 
than 50%. It is also common for one person, usually the final year tutor or 
equivalent to read all the individual project reports and check the comparative 
fairness of the marks. After this has been done, there is often a short (!) meeting 
at which minor adjustments can be made before marks are Anally submitted 
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into the administrative system. Most academic staff are willing to concede that 
marking projects cannot be done to within an accuracy of three percent, and 
swapping the order of two projects between say 68% and 71% on the strength 
of the recommendation of the final year project coordinator’s re-reading of both 
projects is normally not controversial. 

Group projects are moderated by another member of staff overseeing all the 
assessment procedures. The moderator is involved in attending oral presenta- 
tions and in reading the interim and final group reports. There is no involvement 
in peer assessment other than being aware (and commenting on if necessary) 
the procedure used. 



2.6 Assessment of Case Studies 

Some universities and colleges use case studies to mean group projects, in which 
case the above means of assessing group projects applies. Here a case study is 
defined as the presentation of an example of the use of mathematics to solve 
a specific problem, or a new application of mathematics. Normally it will span 
three to five hours of lectures. Students are thus attending lectures and tutorials 
in much the same way as in a standard module. At first sight therefore, there 
seems little reason to assess case studies in a manner different from a standard 
lecture course, that is, simply set an examination at the end and support this 
with some coursework, the weighting being perhaps 80% on the examination 
and 20% on the coursework. It is however possible to be more adventurous. 
Since a case study is an in-depth look at a particular mathematical technique, 
or some (to the student) new branch of mathematics it might be possible to 
set extended coursework with students working in pairs. Assessment can then 
involve some elements similar to the group project: verbal presentation and 
a common report. There should still be an examination as this is the fairest 
way of assessing the understanding of mathematics that has been given to 
the whole class. For case studies, an open book examination is particularly 
suitable, although in my personal experience there should be some restriction 
on the volume of material allowed into the examination room. 
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Figure 2.5 There should be some limit on the volume of material allowed 
into an open book examination. 




3 

Individual Projects 



3.1 Introduction 

This chapter is unashamedly directed at you the student. It is designed to 
help you through tackling the individual project that often forms part of the 
final year of a mathematics degree. Here some general aspects of undertaking an 
individual project are introduced and several examples of projects are discussed. 
The project examples will encompass a fair spread of mathematics, but that 
is not really the point. This is not a chapter on introducing new mathematics; 
instead the mathematics of each project example is used to amplify aspects of 
the process you go through in order to write a successful project. The form 
the chapter takes is as follows. There are introductory and general sections on 
starting up and writing a project, then there are the project examples. These 
examples are taken from real projects that ran in the University of Plymouth, 
and there is extensive quoting from each example project. Alongside this, there 
is a commentary on aspects such as relevant advice, technical matters, and 
assessment. 

For many students the hardest part of a project is not the mathematics, 
although this of course should be challenging and at the correct level. No, it is 
learning how to write a report that is coherent, well structured, uses good En- 
glish as well as correct mathematics that provides the challenge. Mathematics 
students are not renowned for their expertise in writing: for some this repu- 
tation is harsh but nevertheless there remains a core of students who pursued 
mathematics because of their inability to string words together meaningfully. 
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These days it is not acceptable for graduates to lack transferable skills, and 
writing is certainly one of these. What better way to get a recalcitrant mathe- 
matics student to write than to base a project around a favourite mathematical 
topic? There is more on these writing skills in Section 3.3. Aside from the actual 
English there is also readability and structure. It is important to give every stu- 
dent the opportunity to do exciting mathematics of course, but features such 
as telling a story, giving some biographical detail (where appropriate), spelling 
out limitations, etc., are equally important. In writing up a project you are 
putting mathematics in some context and presenting the whole in a coherent 
package for the intelligent layman. It is emphasised that all the examples here 
are from real projects that formed part of the final year of a degree course in 
mathematics. In fact they all formed one sixth of the assessment of the final 
year. It is useful for you to know and the author is pleased to report that they 
all succeeded. As in studies of pollution or the spread of disease, it is not ac- 
ceptable to engineer a deliberate project failure in order to have an example 
of one! The isolated examples of project failures this particular academic has 
encountered have all been due to the student opting out. Often this can easily 
be prevented, so it is particularly sad but thankfully rare when it happens. 
This is why it is so important to engage the student early and have in place 
various checks that monitor progress and that enable the lecturer to spot the 
first signs of things going wrong. Who can blame a student from “opting out” of 
a project when the subject matter, although glamorous, turns out to be totally 
beyond their mathematical ability, or who is left totally isolated with a super- 
visor who is never there? The projects featured here were graded as follows: 
one lower second, one upper second and two firsts. Details on the assessment of 
each project follow each description. Three of the examples are taken from the 
efforts of my own project students. This enables me to be more forthcoming 
about these assessments. 

Mention needs to be made of the role of computing in a mathematics project. 
A common view is that computing should be regarded as a tool. It is a very 
powerful tool and facilitates numerical analysis. It is quite legitimate for a 
student to spend a lot of project time writing computer programs as long as 
these are a means to an end and not an end in themselves. For example, a 
FORTRAN program that solves a boundary value problem numerically is a 
legitimate part of a project. The student (and project supervisor!) must guard 
against spending all the time honing the program, making it super-efficient 
or providing glamorous output at the expense of correct modelling. This is 
particularly so for the less mathematically gifted student who is working on 
a project without demanding mathematics. One example might be web based 
material for solving differential equations as an educational tool, undertaken by 
a student who is pursuing a joint degree of Mathematics with Education. The 
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evaluation of the output from such a project must be in terms of its educational 
value as a tool for learning mathematics and not in terms of the computing 
design itself. Of course, the downloading of unreferenced material from the 
internet is as unacceptable in a project as anywhere else. The experienced 
lecturer can almost always detect where this is happening (changes of style and 
the sudden onset of excellence!) Some mention of the difficulties of assessing 
Mathematics Education projects was made in the last chapter. Of course good 
design is related to a good educational product but the emphasis must be on 
the latter in a mathematics degree. 

If you are following a joint degree in mathematics and computing say, then a 
project that combines both disciplines is a good idea. The danger with all joint 
degrees is that the student is serving two masters with different agendas. It is 
particularly important to write down in the form of a short description or using 
bullet points what you are expected to do in this case. In the examples that 
follow, this is always done. There is no computing oriented project example 
included here. One is pure mathematics, two are applied mathematics and one 
is mathematical methods but containing a great deal of analysis. In no way is 
it possible to cover all the possible designs of projects. Other omissions include 
projects on mathematical physics, mathematics education and mathematical 
statistics. However it is not the topic that is the emphasis but the method of 
doing the project, its order and level. Before starting the project, how do you 
decide on a topic? 



3.2 Selecting a Project 

How do you select a project? What are the pros and cons? The student who 
knocks on the door of the lecturer and says “I want to do a project” needs 
to be given advice. First, is the project compulsory? If it is not, you will find 
yourself being questioned as to how you came to the conclusion that the project 
is better than the alternatives on offer. Having decided on doing the project, 
do you know what subject? A common first step is to provide all prospective 
students with a list of staff and their interests. Such a list is given here as 
Table 1. Of course, you can come up with your own project topic, but this has 
to be approved by a member of academic staff. Some universities do not allow 
this and have a prescribed list of projects from which to choose. In one sense 
this is a good idea as the university is taking all the responsibility and if the 
project falls over for any reason, the university is in a better position to put 
things right. The student who is permitted to embark on a rather ill-thought- 
out project on their pet topic for example “the mathematics of sailing” can be 
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Table 3.1 Staff Interests 


Name 


Areas of Interest 


Professor Sir Kit Breaker 


Electrodynamics 


Professor Cuthbert Calculus 


Rockets and Astrophysics 


Dr. Derek Delta 


Quantum Mechanics, Generalised Functions 


Professor Phil Dyke 


Dynamical Oceanography and Meteorology, 
Mechanics, Special Functions 


Dr. Eugene Function 


Boundary Value Problems 


Professor Polly Hedron 


Geometry 


Dr. Esther Mator 


Mathematical Statistics 


Dr. Archie Meadies 


History of Mathematics 


Dr. Dai Namics 


Mechanics 


Professor Ternut Onanov 


Boolean Algebra, Discrete Mathematics 


Professor Carol Orry 


Real and Complex Analysis 


Professor Sam Poole 


Applied Statistics 


Dr. Victor Product 


Mathematical Methods 


Dr. Algy Rithem 


Numerical Methods 


Dr. Constance See 


Relativity, Mathematical Physics 


Dr. Ena Sher 


Mechanics 


Dr. Ron Skion 


Differential Equations 


Dr. Fay Splane 


Dynamic Systems 


Dr. Cary Steristics 


Partial Differential Equations 


Professor Harry Stottle 


Foundations of Mathematics, Philosophy 


Dr. Con Straint 


Optimisation 


Dr. May Tricks 


Linear Algebra, Systems of Equations 


Professor Eddy Viscosity 


Fluid Dynamics, Turbulence 



left all at sea! 

It is important that the scope is defined and specific tasks are assigned 
for the student to complete. Personally, I always interrogate the student armed 
with their performance to date to find out what kind of mathematics they enjoy 
and what kind of mathematics they are good at. If these are the same, then 
no problems. The projects supervised by me usually involve my own interests 
in oceanography and meteorology. This is useful for the prospective project 
student as a topic that is the research interest of the lecturer provides the 
student with a plethora of background material, good advice and enthusiasm. 
I was once in the position of “bailing out” a member of staff and supervising a 
student on a topic in which I had very little interest; it was not a success and 
I would not recommend it. The list of names provided in Table 1 is not (with 
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one notable exception) real of course, but their interests are quite typical. This 
mythical mathematics department has several members of staff interested in 
aspects of differential equations which probably reflects a research strength; 
on the other hand nobody professes an interest in graphs and networks or 
the applied side of operational research. The student must be prepared for 
this. It is now becoming the norm for departments to specialise, and small 
departments in particular will not be able to cover the whole of mathematics 
at the depth required for a final year project. Departments have their strengths 
and weaknesses and if your particular interest is not catered for you must make 
a choice between good supervision of a less than ideal topic or a more hands-off 
supervision of a favoured topic. All the academic staff in the department can do 
is make sure that you are aware of the situation and to proffer advice. Another 
choice is whether to go for a project subject that is highly mathematical (say 
something on quark confinement in modern quantum mechanics) which might 
be interesting but difficult, or something less mathematical (say a biography 
of Leonhard Euler). Often the advice is that the less mathematical project is 
safer, that is it is likely to be graded in the middle (lower second). Therefore if 
you are striving for a first, go for a hard project that might get a first class mark 
and help you achieve your ambition. On the other hand, if you are struggling 
to keep up or finding the technicalities of several of the modules taxing, then 
go for the mathematically less demanding project at which you are less likely 
to founder. I think that this is good advice. With greater access to higher 
education a priority with many governments, notably here in the UK, it is 
important for mathematics degrees to cater for a wider ability range than they 
did twenty or thirty years ago. 

Before looking at some examples of projects, the next section is devoted to 
the art of report writing, including the interim report 



3.3 Report Writing 

Report writing is an art everyone would do well to master. There certainly 
will be guidelines on how to write a report in the student handbook which 
is usually given to each one of you during induction week. If this is not the 
case there will be guidelines issued either on paper to each student or (less 
satisfactorily given student hardship and the cost of printing) on the university 
or departmental website. In this section the opportunity will be taken to give 
some general guidance. 

It is common for many lecturers to forget that the project report will prob- 
ably be the first lengthy scientifically based piece of work a student has ever 
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produced. This being so, although some of what follows may seem obvious to 
the readers of this text who are academics, in my experience students will find 
it useful. 

In this chapter we concentrate mainly on the production of the individual 
project report, but most of what follows is general enough to apply to all 
substantial pieces of student writing. The project usually gets going in early 
autumn (say October) and has to be completed by early May. There will be 
departures from this timetable which happens to be the one that is used at 
the University of Plymouth, but the departures will not be significant except 
in one case. In some universities an individual project will only extend over 
half an academic year, either between October and January, or February and 
June. If this is the case, the general principles remain the same except that all 
has to be done over a shorter time scale. The supervisor and student should 
meet weekly. In recent years this meeting has often had to be short, say half 
an hour, due to resource constraints; however it is the frequency that is much 
more important than its length. At an early meeting, the project supervisor 
should give advice on writing the report and map out the writing timetable 
for the student. You also need to come to grips with the “interim report” if 
it exists. The temptation to spend all contact discussing the technicalities of 
the project should be resisted. In conversations with their tutors and lecturers 
and in written guidelines, students will be told to make good use of their 
supervisor. Early on, you will be faced with the first point at which some kind 
of assessment might be expected. This is usually the interim report. If this is 
as soon as two months into the project, then it is there to act as a mark at 
which you can spell out what you expect to do in your project in some detail. If 
some mathematics is required, perhaps to give the assessors some indication of 
the mathematical level, then concentrate on solving one specific problem and 
write up the solution. What emerges is a piece of work that both helps you to 
structure the project and assess how hard the mathematics is likely to be, and 
helps the assessors get a better idea of what might be coming. If the interim 
report is expected later, say after Christmas, then it is really a progress report. 
In it you should give a short account of the work done so far and what you 
expect to complete before the hand-in date. This interim report provides a rich 
source for any oral presentation that might also accompany it. Returning to 
the final report, if this is produced entirely without input from the supervisor 
it is never the best and is often a disaster. You should give yourself plenty of 
time to write the final report. A common error is to use all available time to 
concentrate on the technical material. It is always tempting to go just that little 
bit further mathematically but then the unfortunate student is left rushing the 
write up as the submission deadline suddenly looms. Time is needed for the 
supervisor to read the entire submission and to give constructive feedback on 
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aspects from minutiae such as sentence construction, to the overall structure 
and layout of the report. There also needs to be plenty of time for proof-reading 
and detection and correction of typing and other perhaps more serious errors. 

You need to understand what you have written. It is all too easy to suc- 
cumb to the temptation to copy mathematics, a proof or derivation, without 
fully understanding it. There will be some kind of oral assessment, so students 
doing this are adopting a high risk strategy. It should be emphasised to all 
students that they need to comprehend all the mathematics that they put in 
their project. Any reproduced derivation or proof must therefore be fully under- 
stood as well as fully referenced. It is the view of this author that mathematics 
projects are best produced in I^Tfi^X. However, most students will already have 
experience using Microsoft Word from school or college and it is quite adequate 
for their needs. Moreover, HTf^X can be difficult to master quickly especially 
if you are not a programmer and are not used to typing that is not WYSI- 
WYG (What You See Is What You Get). The treatment of mathematics by 
HTf^X in terms of storage, layout and flexibility is superior as it is a specifi- 
cally designed mathematics typesetting package. (This book is being produced, 
needless to say, using HTeX.) An added advantage is that certain computer 
algebra packages such as MAPLE interface with LTeX successfully. There is no 
natural interaction with Microsoft Word. However if another Microsoft prod- 
uct such as Excel is used for tables or graphs then using Word works quite 
well. You must make your own choice: assessors should be concerned with the 
mathematics not the typesetting, although a good layout must be a plus point. 

Students are well advised to produce a skeleton report with chapters and 
section headings before writing anything substantial. You should then give this 
to the supervisor who will provide valuable feedback. It is also a good idea at 
an early stage to write a section or two and get feedback on writing style as well 
as content. The overall style of the report should be for the intelligent layman 
who can understand the mathematics. There are plenty of spell checkers and 
grammar checkers on word processing software these days, but students need 
to be aware of their limitations. Some of the most common words are absent 
from the electronic spell checker and the grammar advice can vary from the 
bizarre to the plain wrong. Students from the UK also need to be aware that 
most of these electronic aids to writing are based on American English. Using 
a dictionary and a thesaurus therefore remains sound advice. The first part 
of the project to write up in any detail is the part with which the student is 
most comfortable. This usually is part of chapter two or three, the “guts” of 
the project. The introduction is the last part to write. 

References are important in a project, and it is important to get both the 
content and the layout right. UTeX has a good referencing facility (BibTeX) 
that can help but may not be easy for the undergraduate to master quickly 
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enough. Books should be referred to in the following style: 

Dyke P.P.G. (2000) An Introduction to Laplace Transforms and Fourier Se- 
ries Springer- Verlag, London Berlin Heidelberg 250pp. 

Papers on the other hand should be referenced as follows: 

Dyke P.P.G. (1980) On the Stokes’ drift induced by tidal motions in a wide 
estuary Est. Coast. Mar. Sci. 11 pp 17 - 25. 

Sometimes there is an overall page restriction introduced. This is there to pre- 
vent students producing a project report the length of which would be more 
typical of a PhD thesis. Such over-long reports are difficult to assess fairly, 
they take too much student time to produce, often to the detriment of their 
other work, and they tend to be undisciplined. Forcing the student to produce 
a succinct project report is a good strategy. Fifty pages for a project worth one 
sixth of a year is a good guide. 

In the next sections, we introduce the first of the specific projects. They are 
outlined in some detail and comments made. Remember, the purpose is to give 
examples of how projects run, not to introduce new mathematics. Therefore 
if the mathematical content of a particular project is obscure, no matter. You 
will still be able to get information about project methodology (though not as 
much as those who understand everything). In all the examples given below, 
all commentary is mine. There is a portion of quoted text, usually from the 
project introduction, and the mathematics is quoted directly from the student 
project, but without the misprints. The project details are given in reported 
speech. So the phrases “the student wrote....” and “in the project, the student 
gives the following proof....” will be found. This is to make it clear what is 
actually in the project and to distinguish it from something this author would 
like to have seen there but was missing. It is hoped that this style does not 
detract from the object of this chapter which is to be of use to students actually 
undertaking and writing up individual projects. The first project is undeniably 
pure mathematics. 



3.4 Non-Euclidean Geometry 

This first example is a project that is concerned with the geometries developed 
in the early nineteenth century. The level of mathematics may not be thought 
of as being particularly high, but much of it will be new to the student and that 
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is the point. It is important that this is so of course as the regurgitation of notes 
from another module is unacceptable. In large departments the detection of this 
could be a real problem. To maximise the detection, wide publicity must be 
given by placing lists of project titles, students names and their supervisors on 
noticeboards or bulletin boards on the local intranet. The project is a vehicle 
for the student to explore an avenue of mathematics on his or her own; to 
read new mathematics, to understand theorems and corollaries to them, all 
independent of other students. As a bonus, at job interviews it can be a great 
advantage to be able to wax lyrically about your project. 

In the 1960s, all teenagers at third form level (aged 14 or 15, now called year 
9 or Key Stage 3 in the UK) fought their way through the theorems of Euclid. 
Those that became undergraduate mathematicians were thus very well versed 
in the geometry of triangles and circles and, more importantly could prove 
results using the axioms that originated from the Elements of Euclid. The idea 
of proposing axioms and using them in a logical way to deduce theorems and 
corollaries was therefore met at this important formative age. However since 
the demise of the popularity of Euclid and his geometry, such methodology has 
all but vanished. The first proofs students see if they see any are algebraic and 
introduced in the lower sixth. (Proofs by induction and contradiction are the 
usual first examples). Logical thinking thus takes a back seat these days, and 
geometrical thinking features hardly at all. Potentially the student who took 
on this project therefore had to learn about geometric proofs perhaps for the 
first time. Actually, being of mature years this student had the advantage of a 
1960s schooling but it lay deep in the memory banks. 

Even though there does need to be an overall minimum standard that has 
to be achieved it is always important to be mindful of the background of the 
student when reading or supervising a project. It needs to be remembered 
therefore that mathematics that was met at school by the lecturer could be 
new to the student and hence be a legitimate vehicle for an undergraduate 
student project. It is time now to look at the mathematics in some detail. 
Of course the object here is not to explore non-Euclidean geometry as in a 
textbook, but to see at first hand some of the processes of producing a project 
using non-Euclidean geometry as a vehicle. 



3.4.1 Scope 

One of the most important aspects of an individual project to decide at the 
outset is its scope. The enthusiastic student is often over ambitious and if al- 
lowed free rein the rest of his or her course will suffer! One way to prevent 
this is to limit pagination as already mentioned, but this is sometimes insuffi- 
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dent. The advice and guidance of the tutor or project supervisor is essential 
at an early stage. This topic, non-Euclidean geometry, is not new thus advice 
is vital. For a newer topic which is more narrowly defined (the use of prime 
numbers for encryption for example) guidance is easier as the subject tends 
to be more closely defined. In older mathematics there has been time for the 
subject to mature and grow and for links between it and other mathematics to 
be forged. The students therefore is in need of more guidance. For this project, 
the following objectives would have been appropriate: 

• to understand the axiomatic approach to geometry 

• to outline the principles of projective geometry 

• to understand the Klein view of geometry. 

Armed with this kind of minimal guidance, the student has less excuse for 
straying too widely. However (s)he is now given some references and told to go 
away, read and explore. The student must also be encouraged to seek advice 
quickly if there is anything they do not understand. 

Quite typically but not essentially for a final year project, the reference 
material consists of textbooks, e.g. H.S.M.Coxeter’s “Projective Geometry” 
published in 1987 by Springer- Verlag. This reinforces the view that the material 
is indeed suitable undergraduate level mathematics; however it is not part 
of lectures. As we have already said if it is all entirely lecture material then 
it is unsuitable as a project topic. However, a project that extends a final 
stage module is perfectly acceptable and is in fact a good idea. Non-Euclidean 
geometry may be a topic in final stage undergraduate mathematics in a few 
universities but it is only given a mention in passing at Plymouth, the source 
of this and all subsequent project examples. 



3.4.2 Project Details 

Most projects need to set the mathematics involved in some kind of context. 
Particularly in a pure mathematics project a certain amount of history of math- 
ematics becomes necessary. Here is an excerpt from the introduction of this 
project which shows the student’s attempt at doing this: 

From Babylonian times some four thousand years ago, there is evidence of the 
study and use of geometry. In those times the geometry used was practical, 
for working out areas of fields, volumes of grain and so forth. The formulation 
of the mathematical problems was rhetorical and procedures used without 
proof. The move from procedure to proof was instigated by the Greeks who 
used the axiomatic approach to formalise their geometry. Records of this 
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(circa 300 BC) have been preserved in Enclid’s Elements, thirteen volumes 
of Greek mathematical thought. From this time to the beginning of the 
eighteenth century little changed in the basic ideas of geometric theory. There 
had been, during that thousand year gap, controversy over Euclid’s fifth 
parallel postulate with many mathematicians working to find a proof of this 
postulate. In the sixteenth century some progress was made in projective 
geometry prompted by the artistic attempts to represent realistically three 
dimensional space in two dimensions. But it was not until the eighteenth 
century that significant changes appeared in geometrical thinking. 

The student then goes on to discuss the axioms of Euclid as they were criti- 
cised by mathematicians of the eighteenth century, particularly Felix Klein. Of 
course, the phrase “non-Euclidean geometry” needs to be defined. From this 
it follows that it is necessary to understand something about axiomatic math- 
ematics. There is therefore an introductory chapter on axiomatic methods. It 
has already been mentioned that the student is unlikely to be very familiar 
with this kind of mathematics. Historically the “guts” of non-Euclidean geom- 
etry focuses on developments in the early nineteenth century; therefore some 
history of events and biographical details of Lobachewsky and Gauss might be 
expected. The work of Felix Klein is crucial, especially his linking of geome- 
try and group theory. This work dating from 1872 may be expected to feature 
prominently. There is also David Hilbert’s extension (correction?) to the Euclid 
postulates. It is well to contrast this historical material with projects in applied 
mathematics where setting the context of the project in the real world tends 
to be more important. This will be seen in later examples in this chapter. 

Following the setting of the historical perspective, the student needs to do 
some mathematics. Sometimes it is necessary to remind students that a mathe- 
matics project needs to contain mathematics! Even those principally concerned 
with history (a biography of Euler or Gauss for example) needs to convey the 
mathematics of the time. This point was alluded to in Ghapter 2, and will 
emerge again. This project states axioms as follows: 

• For every point P and for every point Q, P not equal to Q there exists a 

unique line I that passes through P and Q. 

• For every line I there exist two distinct points incident with 1. 

Such axioms with their formality of language can seem very strange to many 
students. As a common aim of a project is to be able to present new mathe- 
matics (new to the students that is) to fellow students it is quite appropriate 
to select a topic for a short presentation to the class. An introduction to this 
kind of formal mathematics is one possible topic for this talk arising from this 
project. This presentation is often the first formal hurdle the project student 
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faces. Choosing as a vehicle a topic covered early in the project is thus a good 
idea. 

Once the general scene is set, one possible route for this project to take was 
to concentrate on, say, projective geometry. One distinctive aspect of projective 
geometry is the lack of obvious definition of distance. Another is the presence 
of pure geometric theorems. Students appreciate being able to try out new 
mathematics in calculations. In projective geometry homogeneous co-ordinates 
are usefully employed to prove results. These results can be proved using other 
means open to the student, for example vector algebra, but homogeneous co- 
ordinates are particularly neat. At this point we shall delve into the specific 
mathematical details of the project. Homogeneous co-ordinates are defined as 
follows. If we have plane Cartesian co-ordinates (A, Y) then a typical straight 
line takes the form of the linear equation 

aX + bY + c=Q. 

By putting 

A = -, and A = - 
z z 

the line becomes 

ax + by + cz = 0. 

The triplet (x,y,z) defines homogeneous co-ordinates. Points for which z = 0 
are called “points at infinity” and z = 0 itself is the “line at infinity” . That all 
conics look like ellipses with the “ellipse” being a parabola if it is tangent to the 
line at infinity and a hyperbola if it crosses this line is intriguing to students 
and helps retain interest as well as cement an understanding of this form of 
non-Euclidean geometry. 

To give a flavour of one of the geometric proofs, here is a statement and 
proof of Desargues’ theorem as it might appear in this student’s project. 

Theorem 3.1 

Desargues’ Theorem 

If two triangles are in perspective, then the intersection of corresponding 
sides are collinear, see Figure 3.1. 

Proof Let U be the intersection of the lines of perspective, and let it have 
homogeneous co-ordinates (a,/3,7). This means that if the two triangles are 
ABC and A'B'C then AA! , BB' and CC all pass through U as shown in 
Figure 3.1. We can arbitrarily set the points A, B and C to have co-ordinates 
(1,0,0), (0,1,0) and (0,0,1) respectively. The vertices of the other triangle 
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A'B'C will be of the form {a + {a,P + and (a,/3 , 7 + v). The 

equations of the three sides of A'B' , B'C and C A' are 



X y z 

a + X [3 7 

a (3 + n ^ 



= 0 



X y 

a (3 + y, 
a f3 



z 

7 



and 



X 

a 

cr T A 



y z 

13 j + iz 
(3 7 



= 0 



= 0 . 



Algebraic manipulation is used to show that these lines intersect AB ( 2 ; = 0), 
BC {x = 0) and CA {y = 0) on the line 



X 

X 



2 / 

/3 



= 0 . 
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Thus we have found in algebraic form the actual line that contain the three 
points of intersection of AC and A'C , AB and A' B' , and BC and B'C . Hence 
Desargues’ theorem is established. 

□ 



This proof demonstrates the power of linear algebra to prove geometric the- 
orems. This proof is devoid of error and its appearance exactly like this in a 
project might lead one to suspect that it had been copied verbatim from a 
textbook. This is not a problem provided the student understands what (s)he 
has copied and this can be assessed via an oral examination. The student must 
be able to understand for example why it is possible to assign the points A, B 
and C the rather special looking homogeneous co-ordinates. Any student who 
was not confident would not be sure this was a correct step and the proof would 
look “fiddled”. The algebraic manipulation used to achieve the final result is 
not trivial either and a careless student would soon get in a mess. Pure geom- 
etry arguments could of course be used, but most students find the algebraic 
approach easier to follow. These are just a few learning points; I am sure you 
can spot others. 

Let us move on to discuss other technical points. An important quantity in 
projective geometry is the cross-ratio which is defined by the student as follows 



(A,B,C,D) 



AC . AD 

^ ^ Ub' 



The student points out that only the ratio of lengths is required in this defi- 
nition and not the lengths themselves. Defining a quantity that is new to the 
student then examining some of its properties makes a good assignment in a 
student project. Here is the proof of the invariance of cross-ratio under a 1 — 1 
correspondence between points as it appears in the project. It is simple algebra 
but usefully gets the student familiar with the cross-ratio. 



Theorem 3.2 

Let (xi,yi), {x 2 , j/ 2 ), 1 / 3 ), and (x 4 , j/ 4 ) be four corresponding pairs of a given 

(1,1) correspondence. Then 

(a;i,a;2,a;3,a;4) = (yi, 2 / 2 , J/3, 2 / 4 )- 

Proof Suppose that the equation of the correspondence is 

axy bx cy d = 0. 



Then 

axiyi bxi cyi -I- d = 0, and ax^ys 6 x 3 -I- cy^ -I- d = 0. 
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Subtracting these two equations and rearranging gives 
b{xi - X 3 ) + c{yi - ys) = -a{xiyi - xsys) = -axi{yi - t/ 3 ) - ay^ixi - X 3 ). 
This gives 



Similarly we derive 



and 



Xl - X3 

yi - 2/3 

Xl — X4 

yi - V4 

X 2 - X3 
2/2 - 2/3 

X2 — X4 



ax\ 



ays 



axi 



aj /4 + b 
ax 2 + c 
ays + b 

ax 2 + c 



2/2 - 2/4 aj /4 + b 

Judicious dividing of these equations to eliminate the constants a, b, and c gives 



Xl - X3 1 x2- Xs 
X\ — X4I X2 — X4 

which is the desired result. 



2/1 - 2/3 / 2/2 - 2/3 
2/1 - 2/4 / 2/2 - 2/4 



□ 



Of course, pure geometric proofs may also feature. For example here is Pappus’ 
Theorem: 



Theorem 3.3 (Pappus) 

Let P, Q and R, and L, M and N be two sets of three collinear points. Let 
QN, RM meet at the point F, let RL, PN meet at the point G and let PM, 
QL meet at the point P[. Then the points F, G and Ff are collinear. 



Proof 1 This is the standard algebraic proof (absent from this project, but 
given here for information) in outline. Let the six points have the following 
co-ordinates: 

P=(p,0,l) g=(<z,0,l) i?=(r,0,l) 

L=(/,1,0) M=(m,l,0) A^=(n,l,0). 

The equation of the line QN is the determinant: 



X y 
q 0 
n 1 



z 

1 

0 



= 0 



or 



X + ny + qz = 0. 
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The line RM has the equation 



—X + my + rz = 0 

which is found similarly. The intersection of these two lines is found by solving 
the simultaneous equations as 

F = {mq — nr, q — r,m — n). 



Similarly 

G = {nr — lp,r — p,n — 1), and F[ = {Ip — mq,p — q,l — m). 

The determinant formed by these right hand sides: 

mq — nr q — r m — n 
nr — Ip r — p n — I 
Ip — mq p — q I — m 

is zero as is apparent by adding the rows. This shows collinearity and establishes 
Pappus’ Theorem. 

□ 

The handle turning nature of this proof meets with favour from the student, 
but tends to leave the tutor cold. Lecturers prefer geometric proofs that give 
more insight. Here is one for Pappus’ Theorem that resembles the proof that 
actually appeared in the project report: 

Proof 2 In this proof, co-ordinates are not used. Instead we label the 
intersections of the lines PM and RL as A and PN and RM as B. We then rely 
on theorems involving projectivity (which may not be familiar to everyone!). 
A projectivity is a 1 — 1 correspondence between points P and Q such that 
each P gives rise to a unique Q and vice versa. It is sometimes also called 
a homography. The line PHAM is in projectivity with the line PQRO with 
respect to the point L. Moving our attention to the point N , PQRO is in turn 
in projectivity with the line BFRM. Therefore the lines PHAM and BFRM 
are in projectivity, and a theorem can be used that tells us that provided the 
lines have a point that corresponds to itself (in our case the point M) we have 
a projectivity and a perspectivity which implies that the lines PB, HF and 
AR are concurrent. This implies that G lies on HF being the intersection of 
all three lines and Pappus’ theorem is once again established. (See Figure 3.2.) 

□ 
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Which proof is preferred is a matter of taste. The geometric proof demands 
a specialist background but wins in elegance. The algebraic proof has the ad- 
vantage of being easier to follow in common to most “methods” proofs. For 
the student of mathematics, as we have said there is much to recommend the 
geometric proof and it was probably at the instigation of the project supervisor 
that this version appears in the project that provided the source material for 
this section. This kind of geometric thinking is likely to be very new to most 
students but very beneficial to their mathematical understanding. 

There are many other fundamental concepts that might find their way into 
a project on this topic. For example this student introduced the Principle of 
Duality whereby in any theorem in plane geometry simply swap “point” for 
“line” and vice versa, for “line joining two points” substitute “intersection of 
two lines” and vice-versa. One can see for example that the dual of Desargues’s 
theorem is the same as its converse. There are other cases where the application 
of the Principle of Duality gives rise to distinctly different theorems. There is 
the whole question of conics which as we have said in projective geometry 
all look like ellipses, and their relationship with triangles. For example some 
readers may be familiar with the nine-points circle which in projective geometry 
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becomes the eleven-points conic. Suffice it to say that projective geometry is 
a very rich subject and many final year projects could be based upon it. If 
there is a projective geometry module (or two!) somewhere else in the course 
so much the better as the project can build on the taught material and be 
much richer as a result. Note that there is not much mention here of the third 
bullet point under “scope” concerning the Klein view of geometry. As a project 
progresses it is not unusual for the emphasis to alter and in this case it moved 
into projective geometry and away from the work of Klein. As long as tutor 
and student agree this is not a problem. This mirrors the business world in 
which although strategic plans for the next ten years are drawn up, typically 
they change inside three. 

The assessors for this project were impressed by the mathematics in the 
project, and by the performance of the student at the presentation and at the 
oral examination. The layout of the project and the writing style was excellent. 
It was felt that the student could have done more on the Klein view of geometry, 
but the consensus final mark was that the project deserved first class honours, 
but near the borderline. 

Some students would rather tackle a project on a subject that relates more 
to the world outside the classroom. The next project is certainly one of these. 



3.5 Boomerangs 

The boomerang is the Australian weapon that if thrown correctly moves in a 
(large) circle, or slightly distorted circle, and lands at the foot of the thrower 
unless interrupted by hitting the target (a kangaroo perhaps). So if the stu- 
dent wants to find out how a boomerang works, the supervisor needs to tell 
the student about the kind of mathematical background required. This partic- 
ular project is very difficult to do within mathematics courses that contain no 
mechanics, although not impossible. It is all too easy for a specifically defined 
project such as this ending up as an introduction to rigid body mechanics, 
particularly as in Plymouth, where this project was set, the course at the time 
only contained particle mechanics. This provides a less than ideal background. 
If some knowledge of rigid body dynamics, particularly the spinning body, had 
been met before then so much of this project need not have been spent on the 
fundamentals of rigid body mechanics. 

The role of applied mathematics in school mathematics has certainly 
changed a lot in the past thirty years. Mechanics used to be a substantial 
and compulsory part of the mathematical diet of the 16 to 18 year old. The 
two thick books by Humphrey and Topping called “Statics” and “Dynamics” 
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and now long out of print covered the material. Old well thumbed copies may 
still be found in libraries, but more likely they have been sold off cheaply, given 
away or even thrown out. In any case the material in old ‘A’ level mechanics 
texts is almost all inaccessible to today’s students due to the lack of geome- 
try already alluded to. It is perfectly possible for the 16 year old to embark 
on a mathematics course, ‘A’ level or its equivalent, that contains no physi- 
cal applied mathematics. Engineers think this sacrilegious, but it is perfectly 
reasonable if one considers the destiny of students of ‘A’ level mathematics 
these days. There is business and finance, and computing that soaks up most 
students and these need statistics and decision mathematics much more than 
knowledge of Newton’s laws. There are also the social sciences, biology and 
psychology, all of which have become more quantitative and require knowledge 
of statistics. Where better to put this statistics than in ‘A’ level mathematics? 
Of course there is still a place for traditional applied mathematics, but it is 
no longer seen as needing to be compulsory. This then has changed the scope 
of mechanics in mathematics degrees. Some universities can still deliver mod- 
ules in Mechanics not a great deal different in content to those of the 1960s 
(apart from computer based material of course). Students from these are well 
equipped to tackle a project on boomerangs and such universities are large 
and usually have substantial engineering and/or physics departments where 
mechanics is also core and provides an environment which is attractive to the 
academic applied mathematician. When the project started, it soon became 
apparent that it would be a good idea to incorporate frisbees too. Frisbees are 
discs which, being symmetric and geometrically simpler than boomerangs are 
easier to analyse. 



3.5.1 Scope 

In applied projects it is very rare to feature any history. It is hardly appropriate 
to give a history of the boomerang. To give a brief historical development of the 
mechanics and fluid mechanics would be straying too far from the central theme 
of the project which is not mechanics or fluid mechanics but an application of 
them. It is of course necessary to have introductory material, but this takes the 
following different form: 

We tackle the analysis of rotational motion with reference to the frisbee and 
boomerang. Since the swept circle of the boomerang resembles a frisbee we 
can base most of our theory on the frisbee then relate the results back to 
the boomerang. There will obviously be some difference in the motion of the 
two projectiles and this will be explained in Chapter 6. In what follows we 
use aerofoil theory to account for the frisbee’s ability to hover. For this we 
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start with irrotational flow past a circular cylinder, using Milne-Thomson’s 
Circle Theorem to obtain the complex potential. We then prove that there 
is a force —puF in the j/-direction. If we can And a shape for which F < 0 
then we can make this force act in the positive y direction. In pursuing this 
shape we use the Joukowski transform to turn the cylinder into an ellipse, 
then a flat plate before Anally settling on a wing shape with a sharp trailing 
edge. 

So what we have here is not history but a genuine introduction to the 
subject matter of the project. The language used reflects the inexperience of 
the student which of course is perfectly acceptable. The material emphasises 
the fluid mechanics side of the work which most excited this particular student. 
However given the foregoing remarks about mechanics, this project may also 
contain an introduction to rigid body mechanics. First of all, a rigid body is 
defined. The special place of plane rotation then leads to considering rotating 
bodies which naturally leads to defining the moment of inertia. The description 
then moves on to defining torque as the moment of force and stating that it 
is equal to the rate of change of angular momentum, and then progressing to 
a discussion of three dimensional rotation and gyroscopes. Only those points 
relevant to spinning boomerangs are covered, but the knowledgeable reader is 
at once aware of the pace of learning here. For some less mechanically inclined 
students this would need to be quick and focused. One essential difference 
between a taught module and a project is that a project is focused and specific 
whereas a module (on mechanics say) needs to provide reasonable coverage of 
the subject and thus must be broader. To cover all of mechanics from defining 
the rigid body to Euler angles and gyroscopes is a full module, but to do that 
which is necessary for a project on boomerangs can be considerably less. For the 
project student here, it was about 30% of the contents of a standard mechanics 
module. The project is principally about modelling a flying aerodynamic body 
and thus must contain some fluid dynamics. In particular enough background 
needs to be covered to understand lift. If the student knows complex variable 
theory then the Joukowski transformation becomes an application of (known) 
two dimensional inviscid flow theory. Otherwise, some fluid dynamics of an 
elementary nature needs to appear in the project. As indicated in the excerpt 
from the introduction, the choice of this particular student is not to include 
elementary two dimensional flow in the project. In fact there is barely enough 
for the reader (and writer!) to understand the complex potential, and this is 
a criticism. Reading the above it is obvious that the student here did know 
a little fluid mechanics and complex variable theory, but some revision was 
required. Here then are the aims of the project as agreed between supervisor 
and student: 

• to outline the theory of rigid body mechanics in enough detail to understand 
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how a flying boomerang (or frisbee) remains stable in flight 

• to understand aerodynamic lift and apply the theory to boomerangs (or 

frisbees). 

• to understand why the thrown boomerang returns to the thrower. 

At the outset, the student was pointed to texts on mechanics and fluid me- 
chanics that covered the topics mentioned above. More specialist articles on 
frisbees and/or boomerangs were found using library searches. For the talk the 
student gave an overview of the project as a whole with some details about 
the Joukowski aerofoil. The amount of mathematics presented was not large, 
but was enough for members of the audience to get the gist. The talk was very 
successful. A success almost guaranteed by this particular student’s flair for 
presentation. 



3.5.2 Project Details 

In this section it is emphasised that the mathematics is quoted directly from the 
project. As is quite typical this project gets straight on with the mathematical 
detail, so let us do the same. From the definition of angular momentum (h = 
r X TOv) the project student takes this definition and generalises it to n particles 

n 

h = ^ r* X 

i=l 

where r^, rrii and Vi are the position vector, mass and velocity of the Ah 
particle respectively. For a rigid body that is rotating with angular velocity ui 
we have = a; x r^. Inserting this into the expression for angular momentum 
gives 

n 

h = X mi{u) X Ti). 

The right hand side of this expression contains a triple vector product which 
can be expressed as follows: 

r* X (u; X Ti) = ujrf - r*(u;.ri) = urf, 

as LJ and are perpendicular. Thus 

n n 

h = = u> rriir'f = lu:, 

i=l i=l 

where / is the moment of inertia of the rigid body. The usual steps whereby 
the finite number of particles become infinitely many infinitesimal particles 




48 



Managing Mathematical Projects - With SuccessI 



was skipped for this project. As we have said, this is acceptable; after all the 
emphasis is on describing boomerangs here. A project report is different from a 
set of lecture notes as they tell a story and the story can presume, state without 
proof, etc., leaving out what is not essential to the narrative at the author’s 
discretion. 

For boomerangs (and frisbees) the relationship torque = rate of change of 
angular momentum is important. Using the above, this is 

r X F = ICj, 

where F is the applied force. To this the student needed to add the analysis of 
a spinning object. The route to this is via Euler’s equations for a body moving 
freely in three dimensions. Again, this may be part of a lecture course, but 
less commonly so these days. Using the above notation Euler’s equations are 
quoted in the project in the form: 

h + a;xh = rxF. 

Quoting this is fine in one respect, but the student has to understand that 
equations need to be referred to a fixed origin and fixed axes. To derive such 
equations is not trivial and an understanding of Coriolis acceleration and ro- 
tating systems is really an essential requirement. Once this is established, the 
precession of a flying spinning object can be analysed. One route to this is to 
analyse a fixed spinning object first (a top or gyroscope say, in this project the 
student used a top) through the following mathematics. Axes that rotate with 
the body are defined by (i,j,k) where k is the axis of rotation and the other 
two axes (i and j) rotate with the top. Figure 3.3 shows the details. Now, the 
bold Greek letter oj is reserved for the rotation of the system referred to fixed 
axes yet to be deflned, so the student uses fl to denote the angular velocity of 
the top about its axis of symmetry. (S)he supposes further that this axis is at 
a constant angle a to a fixed vertical axis K. Therefore the student writes 

a; = uiii + (X>2j + wak 



and 



fi = f?k. 



Under gravity and neglecting friction the only external force acting is gravity, 
so the external torque is 

ok X (— m^K) 

where a is the distance of the centre of mass of the top from the point about 
which it pivots, usually its contact point with the ground. The two axes i 
and j have not been assigned precisely other than that they are in the plane 
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Figure 3.3 A spinning top. 



of rotation of the top. Therefore it is possible to define i as pointing in the 
direction of the torque. Thus 

K = sin ai + cos aj 



and 



r = mgasinai, 



where the letter F is standard notation for torque. Euler’s equations of motion 
can then be written: 



Auji + {C — A)u) 2 t 03 + Cuj 2 f^ = mgasina, 

Auj2 T (a — — Ctoif2 = 0, 

d>3 + f? = 0, 

where A and C denote the moments of inertia of the top about i (or j, it 
matters not which due to axisymmetry) and k respectively. By the way the 
axis i has been defined, and the assumption that a is constant and in the plane 
of j and k, = 0. The second of the above equations then gives d ;2 = 0 which 
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implies W 2 is a constant or that the top processes steadily. This then provides a 
summary as it appears in this project of the background needed to understand 
precession which is essential for realising why a boomerang or frisbee flies like 
it does. 

Apart from technical facility with vectorial mechanics, it is important for 
the student to understand the conservation laws of energy and angular momen- 
tum. The above vector algebra indeed shows good understanding. Of course the 
student can expect to be quizzed on this understanding at the oral examination, 
hence copying without understanding remains a high risk strategy to adopt. 
Under the assumption of a constant tilt of the axis of symmetry the preces- 
sion must be steady. But if more general motion is to be understood, these 
conservation laws, in particular the conservation of angular momentum, can 
help to predict the behaviour of spinning flying bodies such as frisbees and 
boomerangs. As the scope of the project indicates, another important topic to 
understand is aerodynamics. 

A number of pages in the project are devoted to the introduction of inviscid 
flow in two dimensions. This leads to the Joukowski transformation which is 
presented in the form 




z 



where Z and z are two complex variables. The project student then proceeds 
to show that this transformation transforms a circle into an ellipse. Here are 
some details from the project. A circle in the z-plane of the form 

z = ae*^ where 0 < c < a 



is considered. This is substituted into the Joukowski transformation to give 



which since Z = X + iY gives 

A2 



Z = ae*®-k— e"*® 

a 



= a(cos6i -I- zsin6*) -I {cos 9 — i sin 6) 

= ( a -I- — ) cos 9 + i ( a — — \ sin 9 



Y2 



= 1 , 



that is, an ellipse in the Z-plane. The inverse of the Joukowski transformation 
is 



r ^21 


2 ^ 


r ^21 


c 




c 


Cl -\ 




a 


a 




a 



z=iz- 



1 
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and this is substituted into the complex potential for flow ([/) past a cylinder 
with circulation k 



w{z) = U 





to give the complex potential for flow past an ellipse with circulation. This par- 
ticular student had to derive this potential in full: however this would not be 
done if two dimensional aerodynamics formed part of a module. Some courses 
contain thin aerofoil theory which is interesting mathematically. In such a uni- 
versity, comparisons with this approximate theory could be made, but not in 
Plymouth. Next, an off-centre circle z = 7-Pae*® is transformed into an aerofoil 
from which lift is calculated. As it is derivatives of the potential w that give 
the flow and not w itself the calculation is not difficult: 

w{z) = U (z -P 7)e“*“ -P ^ ln(z -P 7) 

[ Z -p 7 J ZTT 

so substituting for z through 




gives 

U [e— - [ ^1 " e“l - 

dw dw/dz [ L^ + 7 j J 27 t(z-P 7 ) 
dZ dZjdz 

Interesting mathematical points about the flow at the singularities at z = ±a 
are discussed, and it is deduced that a circulation 

K = — 4 ttU (a -P 7) sin a 

gives a flow free of singularities. The constant 7 is assumed real in this project; 
the student overlooked the possibility that it might be complex. In fact a com- 
plex value gives a more realistic aerofoil, but some of the algebra can be con- 
fusing. Lift is approximately it pU'^L sin a, where the circulation k, is neglected 
in comparison with a and L is the length of the wing (radius of the frisbee or 
“radius” of the boomerang defined from the geometry in some way) . 

The project then outlines the relationship between vortex shedding and lift 
before bringing together the mechanics and fluid dynamics to discuss the flight 
of frisbees and boomerangs. The boomerang is analysed and some attempt is 
made to understand why it returns to the thrower. The problem is a complex 
one and this part of the project is rather descriptive and not particularly con- 
vincing. So although there is good mathematics in the project ranging over 
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solid and fluid mechanics, and an understanding of the practicalities of throw- 
ing frisbees and boomerangs, the project has shortcomings in terms of knitting 
the three parts together. There were also some mathematical errors and some 
parts where the student fails to be as general as possible. The project topic 
could be criticised as being rather too wide in scope. This is not entirely the 
fault of the student, although (s)he should perhaps have sought more guidance. 
The student who was responsible and mature in attitude although not in years 
certainly enjoyed the project and gained a lot from it. 

It is sometimes difficult to define the scope of a project tightly enough at 
the beginning. One problem is that the mathematical experiences of the project 
supervisor at the equivalent age can be widely different to the student. This is 
not just a generation gap. In mechanics in particular, modern students in the 
UK lack the facility with algebra and geometry to get to grips quickly with 
new mechanics. This is true even for able students: it is as if the oil is missing 
from the bearings. It is all too easy to overestimate the student’s ability and 
assign an over-ambitious project. This one in retrospect did err on the side 
of being over-ambitious although not disastrously so. Both assessors were in 
agreement over the mark to award this project: they both felt that the student 
was very strong at presentation and answering questions, but that there were 
some shortcomings in the student’s understanding of the mathematical detail. 
The mark was between 60 and 65%. 



3.6 Hurricane Dynamics 

The general topic of geophysical fluid dynamics is a reasonably recent addi- 
tion to applied mathematics. Although the ocean, atmosphere and inner earth 
have been described qualitatively since the seventeenth century (the inner earth 
rather inaccurately!), it is only in the twentieth century that any substantial 
quantitative modelling has been done. Modelling in this area is a young sci- 
ence and is still very active thanks largely to the availability of computing 
power together with resources arising from public interest in the environment. 
Hurricanes are one particular kind of severe tropical storm. They are publicly 
well known but continue to be very hard to model accurately. A hurricane is a 
complex system. It is a three dimensional swirling time dependent object with 
sophisticated fluid dynamics and thermodynamics. There is still much that is 
not known about hurricanes, so to do them justice in a final year mathematics 
project is difficult. 
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3.6.1 Scope 

Faced with the above, the student on this project needed to make some deci- 
sions. The idea to do “something about hurricanes” came from the student, so 
the desire was there but a vague idea for a project has to be honed into shape 
quickly and this is one of the roles of the supervisor. After discussions the 
student on this project decided on a particular strategy and the introduction 
consisted of a description of a simple physical model together with diagrams. 
The first chapter of the project is mathematically very elementary and contains 
only a short description of ideal fluid mechanics and a general description of 
dynamic meteorology which is not really an introduction to the project but 
more of an introduction to the subject of meteorology. Instead of quoting this, 
here we give an excerpt from this student’s chapter 2 which is an introduction 
to the hurricane. 

Figures 3.4 and 3.5 show a vertical cross section of the hurricane, which 

consists of four distinct regions and may be described as follows: 




• An updraft region of tall convective clouds and intense rainfall with ac- 

companying release of large amounts of latent heat (region III) 

• A region of rapidly swirling winds corresponding to a large pressure gra- 

dient (region II) 

• A boundary layer (region I) where the swirl winds interact with the sea 





54 



Managing Mathematical Projects - With SuccessI 




Figure 3.5 Cross section of a hurricane. 

surface, causing an inflow which supplies moist air to the updraft in 
region III 

• A warm quiescent, relatively dry core, or eye, in which there is a very slow 
recirculation, (region IV). 

The swirling flow of region II settles very slowly into the region below, sup- 
plying the fluid, which moves radially inward in the boundary layer. There 
is almost no radial motion at all in region II and the flow is therefore ef- 
fectively frictionless. The downdraught in region II is a source to the radial 
inflow in region I which moves into region III where it flows upward and 
outward, conserving angular momentum. At any given radial position, the 
updraft fluid has a deficit in angular momentum, compared to the fluid in 
region II. This is merely due to the frictional losses to the sea surface in the 
boundary layer. 

Again note the absence of history (there was a brief sentence or two in 
the earlier chapter, but not enough to be classified as history). Instead we are 
straight into a model of a hurricane. This is typical of a project that concerns 
a model of a real situation. In the first few weeks of this particular project, the 
student was faced with two rather unpalatable facts. First it became apparent 
that the real hurricane was just too hard to model given the student’s knowledge 
and experience. There could be no rainfall, no time dependent development 
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and no lateral movement of the entire storm. The student was left to model 
a stationary rotating very idealised storm. Even so, the second fact was that 
there was some hard background theory of rotating fluids and thermodynamics 
which had to be understood. It is easy for the student to lose heart under these 
circumstances and the supervisor has to double as a counsellor. Here then are 
the agreed bullet points that cover the tasks of this particular student project: 

• to understand geostrophic balance and thermal wind equations and solve 

them, 

• to understand the Ekman boundary layer theory and apply it to a model of 

a hurricane, 

• to be able to include enough basic thermodynamics to provide a convincing 

model of a hurricane. 

The student on this project was very knowledgeable about hurricanes and pos- 
sessed good background literature in terms of books about them. These texts 
contained little mathematics and were probably aimed at physical geographers. 
Dynamic Meteorology texts, e.g J. R. Holton’s An Introduction to Dynamic Me- 
teorology contained much of the modelling. However these are graduate level 
texts and were difficult for the student. One positive point was preparing and 
delivering the talk which was easy for this student. The talk itself consisted 
of a good general introduction to hurricanes together with the mathematics of 
geostrophy and Ekman dynamics. 



3.6.2 Project Details 



The subject matter here demands a familiarity with partial differential equa- 
tions at a level a little above that normally expected of an undergraduate 
mathematics student at the start of the final year. However, the student did 
partially derive the simplified linear equations of the form 



du 

dv 

di 



- fv 
+ fu 



0 



1 dp 
p dx 
1 dp 
pdy 
1 dp 
p dz 



-P F 



-zf-g + F. 



the first two of which express geostrophic balance and the last of which ex- 
presses hydrostatic balance (provided the time dependence is ignored and the 
externally applied force {Fx,Fy,Fz) = 0). The notation, standard for mete- 
orology and oceanography, is as follows: local Cartesian co-ordinates {x, y, z) 
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are a;-East, j/-North and z-up with associated velocity vector V = (u,v,w). 
{w does not feature as it is so small compared with u and v.) p is pressure, 
p is density (constant in the first instance), / = 217 sin 0 is twice the vertical 
component of the Coriolis acceleration usually called the Coriolis parameter (17 
is the angular velocity of the earth, 9 is latitude), and of course t is time. The 
student learned the consequences of these linear equations. For example that 
in the Northern hemisphere wind is parallel to the isobars and travels around a 
low pressure centre (and a hurricane) anti-clockwise in the absence of friction. 
The student also realised that friction causes the wind to turn inwards towards 
the low pressure giving the characteristic spiralling pattern of clouds seen in 
satellite photographs. The natural co-ordinate system to use when modelling 
a hurricane is plane polar co-ordinates. However there are problems with this, 
for as the student soon found, unfortunately the Ekman equations (given here 
later) are insoluble in polar co-ordinates. The quantity C=^ x V is the vorticity 
which as the student soon found out is an important quantity to understand, 
especially the vertical component of this added to / which is conserved as the 
hurricane moves as a whole (the conservation of potential or absolute vorticity) . 

At the ground friction becomes important to include in a model. The most 
successful model, now nearly one hundred years old, is due to Ekman and 
assumes a boundary layer in which a constant eddy viscosity parametrises the 
complex turbulence and skin friction. This is the model that the student used. 
The first two of the above equations without the time dependence become 

1 dp d'^u 

. 1 dp d'^v 

+ 

where v is the eddy viscosity. Given the linearity of these equations, the student 
has little difficulty with the following procedure for solving them. First of all 
the geostrophic part of the horizontal velocity is separated from the rest and 
given the suffix g so that ( m , v) = (ug + ue, Vg + ve) and 



-fvE 

fuE 






V 



d'^UE 
dz^ 
d'^VE 
dz^ ■ 



The complex velocity w = ue + ivE is introduced and the first equation added 
to i times the second to give 



d'^w 

dz"^ 




V 
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which is solved together with boundary conditions (zero velocity at the ground 
and geostrophic flow above the Ekman layer) to give the following solution: 



U = Ug + Ue = —UgC COS'JZ 

V = Vg + ve = Vg{l + e~'*^ sin^z) 



where 7 = f /(2v) and it has been assumed that the x-axis has been aligned 
to the geostrophic wind (i.e. Vg = 0). The difficulty the student has with this 
part of the project is not so much the mathematics (although there are no- 
tation problems, not reproduced here to avoid unnecessary confusion) but the 
interpretation of the physics. The formulation of a boundary value problem 
out of the assumptions made and then interpreting the solution is unfamiliar 
territory for many students. Incidentally, this Ekman model resurfaces (no pun 
intended) in Chapter 5 as an example of a case study, see Section 5.2. This 
is deliberate and enables direct comparison to be made between an individual 
project and a case study when the background topic is the same. 

The above is mathematically tidy. However, modelling hurricanes is not 
done using neat mathematics and the rest of the report wrestles with some 
mathematical models which do not have tidy solutions. The student found 
immediately that if one included the curvature of the isobars of a hurricane 
this produces an extra term —v^jr on the left hand side of the x equation 
which stops the polar version of the Ekman equations from being solved exactly. 
The student therefore took the Cartesian solution (Ekman spiral) and inferred 
correctly that a rotary geostrophic motion must cause inflow towards the eye. 
From this a general picture begins to emerge, although there is much work to 
be done before anything like a workable model of a hurricane takes shape. The 
middle chapters of the project are devoted to atmospheric thermodynamics and 
a description of the fluid flow in the eye of the hurricane. The first and second 
laws of thermodynamics are given in mathematical form and the project here 
looks sophisticated. For this description we omit the details, but for those not 
versed in the subject its principal complexity lies in the plethora of symbols and 
definitions as well as keeping track of what is constant and what is varying when 
the partial derivatives are calculated. Here is one example from the project. The 
first law of thermodynamics in a meteorological context can be expressed as 



e = T 




Rd j Cp 



where 9 is the potential temperature, T is the temperature, Rd is the gas 
constant for dry air and Cp is the specific heat for dry air at constant pressure. 
It can be seen that this is assuming dry air and ideal gas laws. It can also be 
seen that there are terms that need definition, and these definitions in turn lead 
to the introduction of new concepts. The student was faced with a very real 
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problem of how much to include. In the end, not enough explanation was given 
to the liking of one of the assessors and the student lost marks because of this. 
This student probably included too much mathematical detail and gave the 
assessors too much rope with which to hang the unfortunate undergraduate. 
The next bit of mathematics shows this. The expression 

^ — LcQes 

was derived from the last equation courtesy of three definitions, saturation 
equivalent potential temperature, the latent heat of condensation and the sat- 
uration mixing ratio. This equation is then differentiated and the saturated 
moist entropy S is defined which enables the expression 

CpTdilnOes) ~ TdS 

to be derived. There is then much more partial differentiation and introduction 
of definitions, all of which are correct, but not all of which seem to have been 
understood by the student. 

Combining this complicated thermodynamics with the hydrodynamics of 
a swirling storm was then attempted with partial success. The last chapter 
re-introduced the time dependent terms in the Ekman equations and Laplace 
transforms were used to solve them. The solution to the time dependent prob- 
lem in s-space (where s is the Laplace transform variable) is given as 

s^/s + if 

where <P is the Laplace transform of the complex velocity u + iv. The inver- 
sion of this was not possible, but final and initial value theorems were used to 
good effect to deduce the time dependent behaviour and the student re-derived 
some results which had only previously been given in integral equation form 
(Fredholm Integral of the first kind) by use of the convolution integral. At this 
point this project showed some similarities to the boomerang project of the 
last section in that the pieces did not hang well together. Some differences of 
opinion between assessors also surfaced. The supervisor realised the difficulty 
of the project and wanted to award a mark just over 60%. However, the sec- 
ond assessor was less impressed. All he saw was a rather disjointed and bitty 
project, albeit on a difficult subject, but crucially during the oral examination 
the student seemed unable to answer any penetrative questions. This left the 
second assessor with the impression that much of what was written was not at 
all well understood. Eventually a compromise mark of 56% was reached. This 
student was rather disappointed with this mark, but the lesson is as follows: 



„T = ln 
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better to reduce the scope of the project and really understand all that is writ- 
ten than be too ambitious, even when deep personal interest makes the latter 
extremely tempting. 

These lessons seem to be well understood by the student who took on the 
following project. 



3.7 Hypergeometric Functions 

Hypergeometric functions are solutions to a second order differential equation. 
They used to feature widely in the final stages of mathematics courses, but 
do so rarely these days having largely been displaced by numerical methods. 
However, students who enjoy solving differential equations see the hypergeo- 
metric equation as bringing together and generalising the solutions to more 
specific differential equations. Most students meet Bessel functions and Legen- 
dre functions, and solve these using series (the Frobenius method). Classifying 
the singularities of a differential equation and using the Riemann P function 
also feature in many degree courses. The student who pursued this project was 
particularly keen and quite happy to read and work through long passages of 
the classical work Ordinary Differential Equations by E. L. Ince originally pub- 
lished in 1926. The project starts with defining the hypergeometric equation 

-P {7 - (1 + a + 

finding and classifying the singularities of the equation then expressing the 
solution in the form of a Riemann P function for the hypergeometric function 
as follows: 

r 0 00 1 

i/ = P<0 a 0 X 

[l — 7 (3 j — a — f3 

In order to calculate the Frobenius series solution which was an application of 
a known method, the student had to know about and use Gamma and Beta 
functions. The series was then obtained. Special cases and different forms of the 
hypergeometric function were explored including the use of complex analysis. 
In the final chapter contour integral representations were found. Throughout 
the project, the technical level of the mathematics was high and there are 
several pages of complicated algebra that contain virtually no English words 
other than “therefore” and “hence” ! However, the mathematics here is not new 
and the results are all to be found in texts on special functions or advanced 
differential equations. Indeed, the student found one or two very specialist texts 
devoted entirely to the generalised hypergeometric function. This project was 
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technically demanding but the material was freely available. The scope of the 
project was not difficult to define. The student could stop after a particular task 
(say finding the series solution to the generalised hypergeometric function) or 
go on to something new (integral representation). Provided the student was 
technically competent to deal with the material and enjoyed tackling pages of 
algebra, this was a project that could earn high marks. 



3.7.1 Scope 

In a topic such as this, some history is certainly possible, but this student de- 
cided on a technical approach preferring to give some non-historical background 
on differential equations. Here is an excerpt from the introduction: 

The hypergeometric equation is a second order ordinary differential equation 
of a special kind. It can be shown, for example, that any second order ODE 
with regular singularities (see definition, Chapter 2) one of which is at infinity 
can be transformed into a hypergeometric equation. We show that this is so 
in some particular cases, but begin in Chapter 2 with finding the Frobenius 
series solution to the equation, generalising the series and examining the fact 
that special values substituted into the series yield some familiar elementary 
functions. 

By merging two of the singularities in a certain way we obtain what is called 
the confluent hypergeometric function. This is another species of hypergeo- 
metric function which can be further transformed into Whittaker’s confluent 
hypergeometric function-in the form w" (x) + p{x)w{x) = 0, and also to the 
more familiar Bessel’s equation. 

It is not difficult to see from this short excerpt that the project soon gets 
into some intense mathematics. One can almost detect the impatience of the 
student to dispense with the words and get down to serious mathematics! It is 
important to stress to such students the vital role the individual project plays 
in providing a vehicle for the student to express complicated mathematics to 
a lay audience. This student found this particular aspect hard (although after 
graduating (s)he found work in the financial sector and is doing very well). The 
following bullet points were decided upon to define the scope of the project: 

• to define and find the singular points and series for the solution to the hy- 

pergeometric equation, 

• to relate special cases to well known elementary functions with proofs, 

• to define the confluent hypergeometric function and examine its properties, 

• to find integral representations of the hypergeometric function. 
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One avenue that was possible but not explored was to pursue a numerical 
approach and use computer programming to evaluate hypergeometric functions. 
One imagines this as a whole new project possibly suited to a student on a joint 
mathematics and computing degree. 

As already mentioned there are plenty of texts available on hypergeometric 
functions, starting with the classical one Modern Analysis by Whittaker and 
Watson first published in 1902. The student found many texts including two 
entirely devoted to hypergeometric functions. The topic of the talk was a sample 
proof of a result in the project together with an overview of hypergeometric 
functions. At the time of the talk (halfway through the project) the student 
was not sure how much would be done, so there was a vagueness about the 
scope. The talk was technically good if a little dry. Also some students in the 
audience were a little overwhelmed by the mathematical detail. 



3.7.2 Project Details 



Of all the project examples given in this chapter, this one contains the most 
equations and the least writing, so to get a flavour it is necessary to plunge 
straight away into mathematical detail. The series solution to the hypergeo- 
metric equation is tedious but reasonably straightforward. One substitutes the 
series 

OO 

y = ^ 

n— 0 

into the differential equation 

x{l - -I- {7 - (1 -I- a -I- = 0 

and equates powers of x. The lowest power of x leads to the indicial equation 

r{r — l)ao + yroo = 0 



from which r = 0orr=l — 7 . Straight away there is a special case 7 = 1 to 
consider. The general coefficient of x leads to the recurrence 



an 



— 2r — 2n + 2rn -I- 1 -I- or -|- an — a + Pr + Pn — P + aP 
(r -I- n)(r -|- 7 -|- n — 1) 



^n— 1 • 



To get to this requires stamina, accuracy and knowhow to spot special cases 
for the parameters a, P and 7 . All of which means mathematical ability. The 
digestion of all the algebra is considerably eased by the introduction of the 
Pochhammer symbol (a)„ defined by 



r{a + n) 



{a)n = a{a + l)(o; -I- 2) 



(a -I- n — 1) 
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The student goes on to derive the series corresponding to the root r = 0 taking 
ao = 1 as 



^ V- r{n + a)r{n + „ 

y r(n + 7)r(a)T(/3)n! 



2Fi{a,P,r, x) 



using the standard notation 2 Fi{a,( 3 ',^',x) for the hypergeometric function. 
The second solution corresponding to r = 1 — 7 is derived similarly: 



= ^2-Fi(a — 7 + 1, /3 — 7+ 1; 2 — 7; a;) = 



(ct — 7)n+l(/3 — 7)n+l ^ 

n—0 



,n— 7+1 



(-7)n+2n! 



Obviously when 7=1 the two solutions are no longer independent and a second 
solution has to be sought through letting 



00 

2/2(2;) = yi{x) ln(x) + ^ CnX^ 

n=0 



where yi{x) = 2Fi{a, P;'j;x) is the first solution. This is standard procedure in 
the Frobenius method but still leads to lengthy algebra which needs to be done 
accurately. The books are content with “it can be shown that. . .” so copying a 
textbook is not an easy option. 

There are proofs of the following relations: 



2Fi{-N,c; c ; -x) = (1 + x)^ , 



13 ^ X 

20(5,1; 5;-*“) = —^, 

lim 2 T"i(1,c; 1; -) = 

c—*oo C 



and 



22Fi(1,1;2;-x) = 



ln(l + x) 



The student obviously delighted in spending much time on the details of de- 
riving these results, but they are straightforward. Less so is to show that Leg- 
endre’s differential equation has solution 

{x — l)'^^{A2Fi{ ^ ^ — ; 2> + B2Fi{ — ^ h 1, /i — 1/ -P 1; x)) 



which the student also derived. The project also contains some reasonably 
genuine applied mathematics. There is a solution to the non-linear pendulum 
problem whereby the equation 
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is derived where L is the length of the pendulum and g = 9.81ms It is 
integrated once without difficulty and the solution is written 



\ L J J (cos 6 * — cos 6 *o)^/^ 

which is expressed in terms of the elliptic integral of the first kind 



m = 




d(j) 

yjl — k? sin^ (j) 



It is shown in the project that this can be written 






and the period of oscillation is 



T = 2n 





which becomes 




when 6*0 is small as expected. The second piece of applied mathematics comes 
via the confluent hypergeometric equation. The original full hypergeometric 
equation is divided by (3 and [3 is allowed to tend to infinity. This has the 
effect of merging the two regular singularities at 1 and oo to a single irregular 
singularity at oo. The differential equation becomes the simpler looking 



(Py 



+ - x)^ - ay = 0. 

ax 



The solution of this is written 



iFi(a; r,x) 



OO 



E 

n—0 



{l)nn\ ■ 



This is derived in the project as is the solution to Bessel’s equation 



2 d?u 
^ d^ 



LLiX / 9 9 \ ^ 

X - — h {x — n )u = 0 
ax 



in the form 

u{x) = Ae~^^x'^iFi{n + 2n + 1; 2ix) + Be~^^ x~'^ iFi{—n + —2n + 1; 2zx), 
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and a special polynomial solution to Laguerre’s equation 

in the form ^ 

Ln{x) = iFi{-N] l;x) = 

where N is an integer. It is these Laguerre polynomials that feature in math- 
ematical models of edge waves in the ocean. This too was mentioned in the 
project but without a lot of detail. In a project such as this where there is an 
emphasis on mathematical methods it is good to at least nod in the direction 
of some applications. Another route could have been to relate hypergeometric 
functions to the solution of Mathieu’s equation which is found both in jet en- 
gine outflow modelling and the modelling of sub-harmonic oscillations, but this 
was not done here. 

The final chapter of the project contained the derivation of integral repre- 
sentations of the hypergeometric functions. The student knew complex variable 
theory up to the residue calculus, but had not met branch points before. The 
properties of these had to be learned, for example using branch cuts to render 
a complex plane single valued. There were more pages of mathematics, this 
time complex contour integration with branch cuts and small circles the radii 
of which are made infinitesimally small before the result 



2Fi{a,(3] r, z) 



ni) r F{a + s)F{(i + s)F{-s){-zY 
2mF{a)F{(3) F{j + s) 



is derived and shown to be convergent for \z\ < 1 and |arg(— z)| < tt. The 
contour is a semi-circle in the complex s plane and (of course) z is an arbitrary 
complex number. 

In a project such as this, it is important for the student to be analytically 
correct. The convergence of infinite integrals, the range of differentiability and 
continuity of functions need to be stated, and the student who wrote this project 
was careful in this regard. What emerged here was a carefully and accurately 
done project which contained difficult mathematics. All the mathematics (apart 
from the edge wave application barely mentioned) was nineteenth century but 
a great deal of it was new to the student. The topic had the advantage of 
being “well known” in the sense of being embedded in the literature. Although 
some literature sources were difficult to follow there was no question as to 
the correctness of the mathematics. The student had to work harder to explain 
what (s)he was doing to an intelligent lay audience. If the project as a whole had 
faults, they lay in this direction. The assessors were impressed with the project 
report, and the student performed very well in the oral examination, answering 
all questions clearly, confidently and, most important, correctly. Overall the 
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student did very well on this project, and as you may have already guessed was 
unanimously awarded a clear first. 



3.8 Summary 

When mathematics projects first started in the early 1970s in the UK, it was 
thought they would be on topics such as “ten ways to calculate tt” . The hyper- 
geometric project here is a direct legacy to this kind of project in mathematics. 
It is a skills based project of the type commonly found in mathematics, en- 
gineering and the physical sciences, but absent from the social and political 
sciences and business studies. The other three are in a sense more mature in 
structure. The first is really embedded in the history of mathematics, geometry 
in particular, and the student has the opportunity to be scholarly, read around 
the subject and the end result could almost be publishable. The boomerang 
project was about modelling the behaviour of a device. This meant selecting the 
appropriate mathematics from the student’s store and modelling the motion. 
This is classical mathematical modelling with the emphasis being on the model 
building and less on verification. The hurricane project is more difficult. It at- 
tempts to describe mathematically a natural phenomenon. This is harder than 
describing boomerangs for several reasons, not least because hurricanes can- 
not be controlled. Again the emphasis is on building the mathematical model 
of the hurricane rather than on verifying its accuracy precisely. In both of 
these projects, the students knew the shortcomings of their models. They both 
learned the value of mathematics to model real world phenomena which is a 
definite plus. The geometry project gave the student the opportunity to write 
in a scholarly fashion about a highly mathematical topic. This too is a plus. 
The more traditional hypergeometric equation project enabled the student to 
hone mathematical skills (algebraic manipulation and analysis) to a substantial 
depth and a depth not possible in taught modules, and this is a plus. All the 
projects were therefore successful. There are many other project topics that 
might by many be considered distinctive and unlike those outlined here, but 
I would disagree. Only a project based on computing might perhaps be dif- 
ferent. If it involved solving a real problem numerically it would resemble the 
boomerang project in form, and if it were more “pure” computing (logic or 
formal methods) it could resemble the hypergeometric equation project. There 
may be a hybrid (encryption perhaps) that could have different features, but 
these features would not be significantly distinctive. There is therefore certainly 
enough material in this chapter to help to see how a range of individual projects 
in mathematics operate. 
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The assessment of individual projects was outlined in Chapter 2 alongside 
assessment in general. The main part of the assessment has to be the project 
write up. It is normal for this to be say 80% of the marks with 10% each for 
the oral assessment (viva-voce or alternative) at the end and a further 10% 
for the verbal presentation midway through. Sometimes an interim report is 
asked for (see Chapter 4) and marks are assigned to this. This report should be 
short, less than 5 pages, and give an indication of what has been achieved so 
far and a view of what is to come. The 80% of marks for the written project is 
often subdivided further either explicitly or in the form of guidelines. I prefer 
the latter for maximum flexibility. The project write up should be assessed for 
content and presentation. Content is principally the level and accuracy of the 
mathematics, but is also the correctness and standard of the written English. 
Presentation covers the layout, readability, literary style and general appear- 
ance of the finished article. The assessors, normally two in number, usually 
meet once all the various bits of assessment are assembled and then they agree 
over the final mark. If there is a dispute, a third assessor needs to act as referee. 
In my experience, where there have been disputes it is almost always the super- 
visor doubling as first assessor being more generous than the second assessor 
and again almost always a compromise is quickly reached, usually because the 
second assessor is made more aware of some of the circumstances. 

Normally a chapter in a mathematics text ends with a set of exercises. This 
cannot be done sensibly here so instead a list of possible project titles is given 
together with what hopefully are useful comments. 



3.9 Project Examples 

In all the projects that follow, a student has successfully followed through an 
investigation and has gained an honours degree either in Mathematics, or with 
mathematics as the major subject (i.e forming two thirds of the programme of 
study). No statistics or statistics related projects have been listed. 

The Numerical Solution of Ordinary Differential Equations using 
Adaptive Step-length Techniques 

This project is based squarely in numerical methods and extends the student’s 
experience by solving ODEs using methods not met in the classroom. The 
project can contain a review of those techniques met in class such as Runge- 
Kutta and some multi-step methods, but will move quickly on to other multi- 
step methods and the use of a variable step-length to control the truncation 
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error. There is also the opportunity to write computer programs, or to under- 
stand and perhaps adapt existing software. 

Set Theory 

Here is an example of a project based in Pure Mathematics. This project con- 
cerns ZFS (Zermolo, Fraenkel and Skolem). It needs to contain a discussion of 
ordinal and cardinal numbers, but ultimately it is about the Axiom of Choice: 
given any collection of non-empty sets A, B, C, . . . there exists a set consisting 
of exactly one member from each of the sets A, B, C, . . .. Of course, the con- 
tent and structure of this and any of the projects mentioned here will depend 
on the scope of the modules the student is taking. 

Braid Groups 

Here is another Pure Mathematics project. However it is not as pure as the one 
above. Braids are topological objects linked (no pun intended) to knot theory 
but with applications to areas of geometry, dynamic systems and physics. There 
are many possible projects, but the most natural direction to take would be an 
extension to a final year pure mathematics module on topology looking at the 
group structure and equivalence classes (Artin’s Theorem). There is plenty of 
opportunity to prove relatively simple theorems in a subject that lends itself 
to practical visualisation. 

Population Dynamics in Moths 

As the title implies, this is a project based around solving a set of differen- 
tial equations that emerge from a population model. There could be material 
on the stability or otherwise of such sets of equations, but only if they are not 
included in the course. Extensions to Liapunov criteria and Hopf bifurcation 
can be made, and there is the opportunity to validate the model using real 
data. (If there is data on other populations, perhaps the Canadian hare and 
lynx, then the title of the project can be adapted accordingly). 

Fundamental Concepts in Quantum Computing 

This is a relatively new subject, but may provide the basis for efficient compu- 
tational algorithms and unbreakable codes. However, the student needs to start 
by understanding the Dirac notation and relating quantum concepts such as 
spin to logic gates. If the student has no experience of quantum mechanics and 
no experience of logic then perhaps this is not for him/her. The potential of 
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quantum computing to save vast amounts of money by making networks much 
more efficient is tempting big companies to invest heavily into research. It is 
therefore a good area for students to get into. 

The Mathematics of Sound Wave Motion 

This project derives the wave equation for sound waves and investigates its 
properties. If there is a module on waves on strings, then the derivation can be 
omitted, but there is still plenty of scope to look at modification due to poros- 
ity, incomplete reflections, etc. Investigation into cylindrical waves and spherical 
waves lead to Bessel and Legendre functions, so the student can play fun and 
games with these. Another possibility is the use of mathematical software such 
as Maple to display the waves, especially the three dimensional waves. This 
topic is large and it is not difficult to think of many projects. 

The Three Classical Problems 

These problems are: doubling the cube; squaring the circle; trisecting the angle. 
This project suits a student who likes a well defined project and likes more or 
less to be told what to do. Having said that however, there is scope here for a 
lot of abstract algebra in the form of extension fields. The student can prove 
various results and provide examples and proofs before delivering the coup de 
grace by using extension fields to show that all three problems are impossible 
to solve. This project has to be done from the point of view of rigorous mathe- 
matics of course: a cynical applied mathematician could always draw a square 
of side -y/TT to duplicate the area of a unit circle, or manufacture a cube of side 
^ to double the volume of the unit cube! 

The Quark Model 

This project is most definitely theoretical physics. The Lie group has to be 
introduced, and hence some background on groups is essential. Representa- 
tions are defined and the group SU{2) shown to be useful, but not adequate 
and SU (3) is required to represent the quark. This project requires the facility 
to work with definitions and mappings as well as some manipulative ability 
with 3x3 matrices. It is the sort of project that may tempt the weak student 
because it sounds interesting, but the temptation should be resisted. 

Trapped Waves 



This project involves solving the linear shallow water equations with rotation 
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in order to find the special waves that are trapped to travel along a coast. 
The Kelvin wave is the classical solution, although it is also possible to de- 
rive equations that describe the Continental Shelf Wave. These latter waves do 
not need a free surface elevation but demand a depth change. Relating Kelvin 
waves to tidal oscillations in specific geographical locations such as the North 
Sea can be a rewarding validation experience for the student. This project de- 
mands knowledge of inviscid fluid mechanics and some elementary differential 
equation theory. However, by far the most challenging aspect of all projects 
that try to map the real world is the modelling: what to leave in and what to 
delete from all the choices. For final year undergraduates, almost always it is 
the supervisor that has to make the choice. 

Bessel Functions 

This is another project which is very tightly defined. The student is given 
Watson’s big book on Bessel Functions and told to go away and understand 
it! Well, not quite, but series solutions, special cases, Fourier-Bessel series and 
integral representations can all get an airing within such a project. It is more 
than likely that Bessel’s equation has been met on the course in one form or 
another, in which case duplication must be avoided. The use of complex anal- 
ysis to explore the properties of the equation and the different kinds of Bessel 
function is another profitable avenue for some students to take. As well as the 
standard J,y{z) and Y^{z) there are the modified Bessel functions I,j{z) and 
K^,{z) and the Hankel function + iY^. Then there are the spherical 

Bessel or Kelvin functions ber(z) and bei{z), ker{z) and kei{z), and there is 
the Whittaker function and links to the confluent hypergeometric function to 
explore. Plenty of scope here, but the student must be prepared to do some 
manipulative algebra and calculus. It resembles the hypergeometric function 
project outlined in Section 3.7. 

The Magnetic Field of the Milky Way Galaxy 

This project is undeniably astronomy, and obviously suits the student whose 
interests lie in this direction. Some knowledge of astrophysics is certainly de- 
sirable in order to understand what the field is, as well as concepts such as 
the polarisation of starlight and Faraday rotation. However, the actual math- 
ematics involved needs to be little more than trigonometry, and a little bit of 
mechanics. The hard part is understanding what this represents which makes 
this more of an astronomy project than a mathematics project. As long as there 
is agreement about this (perhaps the student is studying for joint honours) then 
there should be no problem. 
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Let us finish by listing some further project titles that have been successfully 
pursued. The first two are joint Mathematics with Education student projects 
and the same comments as made above about astronomy would apply. 



Table 3.2. A project list 

Title 

An investigation into the implementation of the framework for 
teaching mathematics at key stage 3 (age 12-14) 

Using case study methodology to determine what MTutor © must 
provide for student learning to be successful 

An investigation into the implementation of the National Numeracy 
Strategy 

The four colour problem 
Rotating fluids 
Protostellar genesis 

The mathematical modelling of mechanical systems 

Flow of a Newtonian fluid in an L-shaped channel using separation of 
variables and matching techniques 

An introduction to chaos theory through a simple physical example 
The Klein correspondence between twistor theory and space-time 
Wave power devices 

Dimensional regularisation in quantum field theory 
Knot tabulation 

The conservation of planetary vorticity and its applications 
We have seen in this chapter how a wide variety of interesting topics are 
suitable vehicles for a project in mathematics. 
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Individual projects may still be a novelty to some professional academic 
mathematicians. Group projects in mathematics, the subject of the next chap- 
ter, are likely to be new to everyone. 




4 

Group Projects 



4.1 Introduction 

One distinctive feature of mathematics is that at undergraduate level there is 
virtually no opportunity for debates about different approaches or differences 
of opinion. This contrasts with most other subjects where the opportunity 
for debate is often central to the development of the student. For example, 
debating is the very essence of political science, and in the arts the ability 
of a student to be able to voice an opinion and back it up with argument 
is extremely important. Even in the sciences, although differences of opinion 
are rarer, there is always the opportunity to work together in the laboratory 
or to co-operate and work as a team towards a single goal as part of a large 
project. In the real world, one thinks of large teams working towards better 
treatment or even a cure for cancer and other dreadful ailments. The major 
achievements such as landing a man (person these days!) on the moon and 
returning him safely to earth, or being the first to climb Everest, or developing 
internet technology are all examples of successful team work. As discussion and 
debate are two of the primary objectives of group work one may legitimately 
ask what place this has in mathematics. Let us try to answer this. First of all 
there are undoubted social benefits. In a group, there can be a meeting of minds 
about a particular mathematical topic and this can help students learn from 
other students. A group can increase the self esteem of member students by 
helping when they get stuck on problems or by being sympathetic and otherwise 
supportive. Group work can really help the development of communication 
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skills such as presenting an argument, conducting a meeting, learning to listen 
and respond, etc. All of these are key skills which mathematicians (amongst 
others) have been criticised for lacking on graduation. The acquisition of key 
skills is now very much a part of all education. These points hopefully provide 
reasons to try out one particular form of group work, the group project. 

A group project consists of a group of two to six students working together 
to produce a piece of work which, as was said above, in mathematics is unusual. 
Of course, of all the disciplines, it is recognised that in mathematics it is possible 
to produce excellent work entirely on one’s own. Some suggest that the best 
mathematics is produced by isolated individuals, but although this might be 
true for the most mathematically gifted, at undergraduate level most students 
benefit by not working in isolation. Even those students pursuing the individual 
project, the subject of the last chapter, are really doing so alongside their 
project supervisor as mentor and not by themselves. 

In this chapter, some examples of group projects are analysed. The proce- 
dure of setting up group projects is outlined as well as management issues. Once 
this has been done, there are four examples of group projects. These examples 
are used to bring out what individual students do as well as the interplay (or 
otherwise!) between them. 



4.2 Setting up Group Projects 

At the outset, students need to know as precisely as possible what it means 
for them to be part of a group project. Obviously, at some point well before a 
decision needs to be made, information must be available outlining the way the 
group project is organised. A typical student handout was given in Chapter 1, 
and it is repeated here for convenience. 

After this handout is distributed, more details of the assessment procedure 
is given. The peer assessment in particular intrigues most students. Details of 
the assessment are already given for the most part in Chapter 2. However it is 
worth going into particular detail here about the peer assessment. 



4.2.1 Peer Assessment 

When students work as a group, a new dimension enters assessment. If a group 
of students is working towards a common goal, then the final report has a 
single mark and all students who contribute to the report would gain the same 
mark. This is correct and represents collective responsibility for the final report. 
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Group Project 

Organisation 

A set of students, minimum number 2, maximum number 6, preferred number 3-5, work as a 
team on a project. This is called a Group Project. 

At the beginning of the module, there will be a short sequence of lectures explaining each 
project. At the end of the lectures, students will be expected to choose one of them to work 
on. This does not preclude students from coming up with their own ideas for projects, but 
any ideas must be vetted for suitability by the module leader, and there must be a group 
of students willing to work together on it. In practice each project consists of a number of 
topics that individuals work on. There is a handout accompanying each project that includes 
suggested topics, however these are only suggestions and students working on the project are at 
liberty to formulate their own strategies for completing the project. Again, the module leader 
is there to help and provide guidance on, for example equalising workload within a project 
group. The goal is to produce a project report by the end of the period to which all the group 
has contributed more or less equally, and with which all members of the group is proud to be 
associated. 

The module is worth 10 credits (there are 120 credits in the year) so a rough guide is that each 
student should spend 100 hours on their contribution. 

After the initial lectures, formal class contact is restricted to one hour per week when the 
module leader is available in classroom for general consultancy, making general points from 
matters that arise, giving advice, etc. There are two other timetabled hours in the week when 
the module leader is available in his/her office. This should be used for specific queries, usually 
but not necessarily of a technical nature. 

The choice of who does a particular project is largely up to you. You will be asked to state 
preferences, and these will form the basis of the split into groups. Note that it is perfectly 
possible for more than one group to work on the same project. Usually the projects have 
enough scope for many groups. The module leader does have the final say as it may be in your 
own interest to ensure that the composition of each group has roughly equal mathematical 
ability. 

Assessment 

The assessment of this module will be as follows: 

1. Contribution to final report [hand in deadline] 

70% 

2. Interim Report [hand in deadline] 

10 % 

3. Verbal report to the whole class [date of orals] 

10 % 

4. Peer assessment by other members of the group. 

10 % 



On the other hand, each student’s contribution to the project as a whole is 
distinctive as is the view the rest of the group has on any particular student. 
How to assess this opinion is not easy. Here are a few suggestions, all of which 
I have tried. 

1 One could simply use one’s judgement and give each student a mark. This 
is very subjective and lays the module leader open to criticism. 
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2 A modification one might try is to interview each student and ask questions. 

Then armed with this information, together with one’s own impressions, 
a mark is given to each student. The main problem with this remains 
subjectivity both on the part of the module leader and the students. It is 
open to bias especially if the students “gang up” on one particular student. 

3 A different method is to give the group 100 marks and tell them to distribute 

these marks amongst themselves in an agreed fashion. The final mark given 
to each student is then this agreed mark scaled with the overall project 
mark. As an example, suppose there are four students with an agreed peer 
group mark distribution 40, 24, 20, 16. Say the overall mark for the group 
project final report was 50%. The students would then get 20, 12, 10 and 
8 which if peer assessment was 10% of the total mark would contribute 
2%, 1.2%, 1% and 0.8% to each student’s mark respectively. I have used 
this method and the problem with it is that students tend to be kind to 
each other and agree a uniform distribution of marks. This was so even 
when it was obvious to everyone that in one particular case one student 
was being “carried” by everyone else. Another difficulty is that at extremes 
the scoring system fails. If a project gets 100% (or close) or 0% (or close) 
the module leader has to depart from the formula and give everyone the 
same excellent or awful mark. 

In these days of quality audit and quality assessment, it is a good idea to have 
a second marker. This might seem impractical in peer assessment, but if one 
interprets this as having someone else oversee the process, then all of the above 
can be “second marked” . 

Most students like some idea of what is expected of them in terms of volume 
as well as content. In Plymouth, the individual project is worth 20 credits and 
should be 50 pages in length. In fact to discourage longer projects, the rule is 
that only the first 50 pages are marked (with some allowance for appendices, 
computer programs and the like). If the group project is 10 credits per student 
and the final report is worth 70% of the assessment then each student can 
expect to contribute about 18 pages (^ x 50, rounded up) and for a group of 
five the group project report can be expected to be about 90 pages long. Of 
course this is only a rough guide, but it is useful for students. 



4.2.2 Dividing into Gronps 

Another question that concerns most students is how to select their peers. A 
method that seems to work well is to place on the notice board a chart. The 
body of the chart is empty, along the top are all the projects, and down the 
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left hand side are the student’s names. Each student is then asked to fill in 
the row alongside their name as follows. For the project they would most like 
put a one in the column, for the next preference put a two and so on until all 
columns are filled. If there are five projects from which to choose and there is 
a tie in preference, then instead of 1, 2, 3, 4, 5 there might be 1, 2, 2, 4, 4 
or 1, 1, 1, 4, 5 in some order. The module leader then goes away and tries to 
devise a partition of the student body into groups that keeps everyone happy. 
Sometimes (quite frequently in fact) there is a reason other than an academic 
one for students to form particular groups. They may all live in the same fiat 
or house or hall of residence, or they may all live in the same place but at some 
distance from the university which means they can meet more easily, especially 
out of term time. Perhaps they all share a hobby, maintain similar sleeping 
habits or are simply a group of friends who know they get on and want to work 
together. All these are perfectly good reasons for forming a group. However 
there is an argument for keeping the groups reasonably homogeneous as regards 
to mathematical ability. To have all the best (or all the worst!) students in the 
same group is less than ideal. I have concluded that it is preferable to have a 
happy group of students, so I let them choose and so far this has worked, with 
one or two exceptions. 

So now we have groups of students who know in general terms what is 
expected of them, they have attended some preliminary lectures that outlined 
the projects available and have decided on a topic. They then chose a specific 
group project and know with whom they will have to work. Let us now see how 
they got on with particular projects. 



4.3 Estuarial Diffusion 

This particular group project involved five students. At the beginning of the 
semester, all the students were given some general lectures one of which was 
about modelling diffusion. Students were reminded about the technicalities of 
partial differentiation and how to solve differential equations using separation 
of variables (this technique for partial differential equations was new for most 
students). Students were told that they would meet techniques for solving par- 
tial differential equations including the use of Laplace Transforms in a module 
during the year should they choose it. Some information was then given about 
the physical process of diffusion in terms of the behaviour of molecules, together 
with the briefest introduction to modelling turbulence using eddy viscosity. Fi- 
nally the students were reminded about finite difference methods and were told 
diagrammatically how these could be applied to partial differential equations 
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of diffusion type. A brief mention of the underlying strategy behind the finite 
element method followed. Relevant concurrent lecture courses were also indi- 
cated. Following this, this particular group of students decided to do a project 
focusing on estuarial diffusion. This year, another group (of four) decided on 
heat transfer modelling as a topic. Therefore nine students chose to model 
diffusion, and rather than prohibit four from this general area two separate 
project teams were formed looking at these two different aspects of diffusion 
modelling. When one topic is particularly popular, I find that this is the best 
solution. One year, there were four groups on modelling analytical dynamics 
all doing different projects and we will meet one of these later in this chapter. 
It is still a mystery to me as to why one year a given subject is popular with 
students (this year diffusion) whereas another (this year dynamics) is unpopu- 
lar, yet in the following year the reverse is true. It must be some kind of group 
dynamic as all other features such as the lecturer, handouts and mathemati- 
cal experience of the student body are the same. The group project is a new 
and unfamiliar beast as far as mathematics students are concerned, therefore 
no matter how unfathomable their choices might seem, it is important to try 
and cater for them. However, to return to this particular project. A general 
handout on “Modelling Diffusion” accompanied the lecture, and this handout 
is reproduced here. Now, having read this handout and absorbing the lecture, 
the five students chose the following tasks. Student A researched the history 
of diffusion, writing some biographical details of A. Fick. The one dimensional 
diffusion equation was derived from first principles using Tick’s assumptions. 
This extended and filled in the gaps from the lecture material. Quite a lot of 
physics was involved as well as historical research. This student did well. Stu- 
dent B was a capable mathematician and concentrated on finding analytical 
solutions to the one dimensional diffusion equation. For example, the separation 
of variables solution to the equation 

d(j> 

dt ^ dx'^ 

was found in the form 

OO 

n=l 



where the boundary conditions imposed are </> = 0 at a; = 0, b The detail in the 
project was painstaking and other boundary conditions were considered such 
as 

rhf) 

at X = 1. 



OX 



Perhaps the rest of the group found the mathematical rigour of this section 
a little frightening, but whatever the reason the solutions here were not re- 
lated directly to estuaries in the other chapters. Student C examined numerical 
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Modelling Diffusion 



1. Mathematical Nature 

Analytical modelling using differential equations, techniques such as separation of variables, 
using Laplace transforms, specialist techniques such as using similarity variables. References: 
E. Zauderer Partial Differential Equations of Applied Mathematics 1989, H. F. Weinberger A 
First Course in Partial Differential Equations 1965. A specialist book on modelling diffusion 
is J Crank The Mathematics of Diffusion OUP 1975 

Numerical Modelling using finite differences. References: Books on numerical solutions to par- 
tial differential equations e.g. G. D. Smith Numerical Solution of Partial Differential Equa- 
tions OUP 1969. 

Simulation using random walks and writing a computer program. 

2. Possible Strategies 

Start with the one dimensional diffusion equation 

dtp d^tp 
~ 

where <p may represent temperature, x distance along a bar and t time. Investigate solutions 
to this equations using one or more of the methods listed under 1 above. 

Look at more general equations: 



two dimensional 



d(p 

dt 



d^\ 

dx'^ dy^ j 



and 

, d<p d‘^(p d"^4> 

three dimensional — — k -| H — 

dt dx^ dy^ dz^ J 

Investigate how the methods of solution generalise. What is k and how is it determined? 
Investigate real life applications (e.g. diffusion in estuaries, see R Lewis Dispersion in Estuaries 
and Coastal Waters^ John Wiley 1997) 

3. Example of a Project Team Plan 
Five members A, B, C, D, and E. 

A Investigates one, two and three dimensional analytical solutions. 

B Investigates numerical solutions. 

C Investigates how k is determined, solutions involving variable k and alternatives. 

D Investigates simulation using computing, e.g. particle tracking. 

E Investigates extensions, e.g. non-linear modelling. 



methods of solution. Once more, these were rather idealised situations and not 
directly related to estuaries. This student showed good understanding of finite 
difference methods and explicit, Crank-Nicolson and fully implicit solutions to 
the diffusion equation were analysed in terms of truncation error and stability. 
Finally this student applied a particle tracking random walk method (copied 
from a paper) to simulating pollution in an estuary. The fourth student D 
looked at non-linear diffusion. In particular the analytical solution to the dif- 
fusion of a contaminant in a river containing a stream of constant speed U . 
Laplace transforms are used, and many pages are covered with mathematics to 
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derive that the concentration c{x, t) that satisfies the equation 

dc dc 

dt dx 

with c(0, t) = Co, c(oo, t) = 0 t>0 and c(x, 0) = 0 a; > 0 takes the form 







X — Ut 
2\/ nt 



+ e^“/'‘erfc 



X + Ut 
2\/ nt 



Finally, the fifth student E decided to concentrate on the phenomenon called 
the Kelvin-Helmholz instability. When two immiscible liquids flow over each 
other and the difference in the speeds is large enough then waves first appear 
then grow on the interface. In an estuary the two liquids are freshwater flowing 
over saline water, and this instability can be an important mixing mechanism 
which enhances diffusion. This mechanism for diffusion is different from those 
in the rest of the project as it is a form of shear diffusion caused generally where 
there is a sharp variation of current speed with vertical distance. By its very 
nature there was very little connection with the rest of the project; nevertheless 
the mathematics was of high quality. Water waves were analysed using linear 
theory and those aspects relevant to the discussion of the Kelvin-Helmholz 
instability highlighted. The student made use of the classic text Water Waves 
by J. J. Stoker 1957, the style of which is not easily accessible to undergraduates 
these days. 

The outcomes of the theory are discussed in terms of the gradient Richard- 
son number defined by 



P dp / 

R, = -g^Jp 




The worthiness of this final contribution is not in question, but one wonders if 
the student would not have been happier pursuing an individual project. 

About six weeks into the project, the individuals in this group submitted 
interim reports that showed each of them making reasonable progress. At least 
they all seemed to understand what they had to do which is the important 
thing. The reports were marked by the module leader and feedback was given. 
It is a good idea for a second assessor to get involved too. Perhaps they should 
second mark these reports. This was not done at this stage in Plymouth, but in 
subsequent years it was introduced as part of improvements in quality assurance 
processes. 

The presentation to the whole class (about three months into the project) 
took the form of five mini-presentations. The student who was second up and 
tackled the analytical solutions overran which upset the timing for the three 
that followed. The style of presentation was fine if perhaps a little dry and 
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over formal (due to nerves probably). The students clearly understood the 
mathematics that they presented and were able to answer questions well. Most 
were confident speakers which certainly helped. 

The final report was submitted in time and the group seemed content with 
it. Some comments on this project report follow. In this project it is easy to see 
from the report that each chapter was written by a different student. There was 
no real attempt to link the chapters together apart from a sentence at the end 
of one or two of the type “In the next chapter we will discuss . . .” . If a serious 
attempt has been made to link the subject matter of each chapter then this is 
worth extra marks, but it is quite understandable for each student to do one 
chapter. Time is usually the critical factor, and in a project that only lasts one 
semester refinements such as rewriting chapters in the same style and swapping 
material around are editorial jobs that have a low priority. Another relevant 
point is that in this group, there was no elected leader who might have taken 
on such an editorial role. If the chapters were written in different typefaces 
and the equation numbering and paragraphs were different this would be less 
acceptable. 

The peer assessment for this project was done by interviews. Each student 
was interviewed in isolation by the module leader. It came as no surprise to any 
of the students that all received the same high mark. If the students themselves 
had done the marking, the result would have been no different. 

The feedback from this group of students was very positive and they clearly 
enjoyed and gained a great deal from working on the project. From the module 
leader’s viewpoint on the whole the final written report was acceptable and 
represented the outcome of a successful project. The students certainly seemed 
to work well together. They happened to be all the same sex (female) and 
psychologists tell me that females are more co-operative than males who tend 
to exhibit rivalry in groups. My experience in running group projects agrees 
with this (so far). The module leader and an assessor each marked the written 
project report independently. Both gave it high marks, with the module leader’s 
marks a little but not significantly above the independent assessor’s marks. 
Both were of the opinion that the report represented the work of a group that 
worked well together and had produced a report of high calibre. The marks 
eventually agreed between the assessors were in the first class category and the 
external examiner was also in agreement with this. Two of the group went on 
to get first class degrees. (The other results were one upper second and two 
lower seconds.) 
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4.4 Graphs and Networks 

Graphs have an interesting place in mathematics. A few years ago, they would 
be entirely absent from the mathematics curriculum. Even today, a lecture 
course on graphs would be seen to be unusual: moreover the syllabus would 
have to be scrutinised in detail to find out whether it was pure or applied 
mathematics. The reason for the recent upsurge in interest in graphs stems 
from the ability of computers to do large amounts of routine arithmetic. The 
kind of arithmetic required here is particularly suited to computers as it is 
essentially binary. Decisions have to be made whether to go along one path or 
an alternative subject to the satisfying of certain criteria. The study of graphs 
dates back to the prolific Swiss mathematician Leonhard Euler (1707-1783), 
but really only took off following advances in operational research during the 
Second World War. Now, there are whole industries using advanced network- 
ing algorithms particularly linked to the operation of the internet. Students 
therefore look upon an opportunity to work in this field with some keenness. 
It is also a refreshing change from all that calculus-based mathematics. This 
particular project handout was an attempt to woo those more interested in 
pure mathematics so it is that side of the subject that is emphasised. 

This particular year, two students expressed a desire to do a project involv- 
ing graphs and networks. One would have thought that this would have been 
straightforward, however they fell out from the word go although the module 
leader was not aware of the completeness of the breakdown until three weeks 
into the project. So unfortunately, by this time it was impossible for either of 
them to change to another group. The answer to this problem is a pragmatic 
one: allow both of them to continue as a “group” but also allow them to con- 
tinue independently minimising the amount of interaction. This is really only 
possible in a group of two and is particularly easy in a graphs and networks 
group. One student concentrated on undirected graphs, the other student con- 
centrated on networks. In effect, they both undertook individual mini-projects. 
By not working together, the students missed the opportunity to cooperate, to 
interact, to pool resources and partake in general in the benefits of group work. 
However on the positive side, assessment of the contribution of each was made 
easy. The module leader was not happy with this solution, but it was felt to 
be preferable to shoe-horning them together and producing lots of friction and 
misery but very little else. 

The first student was not particularly strong in terms of mathematical abil- 
ity, so (s)he decided to concentrate on defining various types of graphs and 
outlining some practical uses of graphs. First of all features such as a complete 
graph, a subgraph, cycles and paths were defined. The adjacency matrix and 
the incident matrix were introduced as was the handshaking lemma and its 
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Graphs and Networks 

1. Mathematical Nature This project may only involve arithmetic. It can also involve linear al- 
gebra and matrices. Reference material: books on graph theory by Ronald Gould, by F. Harary 
and by Robin J Wilson. A more applications based book is Mathematics in Communication 
Theory by R H Jones and N C Steele published in 1989 by Ellis Horwood. The first part is 
most relevant for this project, the second more use for Fourier Transforms. 

2. Possible Strategies Several different projects are possible based on graphs and networks. 
Here are two which in themselves have very broad scopes. 

(a) Undirected Graphs - look at the properties of trees; binary trees and quad trees and their 

relationship to the storage of images on software. 

(b) Networks - look at the properties of directed graphs; cutsets, fundamental cycles and apply 

ideas and concepts to the analysis of electrical networks. 

3. An Example of a Project Plan 

(a) Graph Theory (three people) 

A Study tree searching algorithms 

B Look at computer implementation 

C Look at a practical application such as the storage of images or the parsing of sentences. 

(b) Networks (three people) 

A Examine various network flow algorithms e.g. depth first search, FIFO . 

B Investigate more complicated algorithms and their implementation on a computer. 

C Looks at matrix methods for electrical networks and connections to practical applica- 
tions. 

Other topics that could lead to projects are map colouring and tiling. 

consequences. Here is the handshaking lemma: 

In any graph, the sum of all the vertex degrees is equal to twice the number 
of edges. The proof is very straightforward: 

Each edge has exactly two ends, and by the definition of degree it therefore 
contributes exactly twice to the vertex degree. The result follows. 

This gives a flavour of the difficulty (or more strictly the lack of difficulty!) 
of the mathematics in the opening sections. The discussions of bipartite graphs 
and unions and complements of graphs was a little more involved, but it is not 
until applications are addressed that the project really comes alive. This par- 
ticular student found applications in the social science of relationships between 
sets of three people with applications to TV soap operas (!), to engineering in 
the form of the design of waste disposal in a city and the representation of the 
change of energy use by the population of (the same?) city. Finally there was 
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a solution to a puzzle that was popular in the 1970s called “instant insanity”. 
To give a flavour of this part of the project, this solution is given below. 

There are four cubes. The faces of these cubes are either red, blue, yellow 
or green. The object is to stack the four cubes on top of one another such that 
when stacked no two adjacent faces have the same colour. Moreover, looking 
at the four cubes stacked on each other, each vertical face contains each colour 
once and only once. In order to solve this problem by trial and error, even using 
a little common sense only reduces the number of possibilities from 331,776 
(which is 24'^) to 82,944! Hence the name of the puzzle. First of all, each cube is 
represented by a graph as shown in Figure 4.1 The graphs are drawn as follows. 
A cube has three pairs of opposite faces. Draw an edge between vertices if and 
only if the colours shown (G for green, Y for yellow, etc.) are on opposite faces. 
Thus each cube corresponds to a graph with four vertices and three edges. The 
next step is to combine all these graphs into one. This is shown in Figure 4.2. 
The edges are marked with numbers that tell us from which cube the edge 
arises. What happens now is that we search for cycles that are subgroups of 
these graphs. Further, each cycle must have an edge that is labelled with 1, 2, 
3 and 4 just once. No repeats and each cycle must have different edge labels 
between equivalent colours. There are only two distinct cycles of this type and 
these are given below. This enables us to solve the puzzle as follows: mark each 
cube 1, 2, 3 and 4 respectively. Use the first cycle Hi to tell which colours go 
to the back and front, and the second cycle H 2 to tell which colours go left and 
right. So on the first cube Y is in front and B is opposite this whereas G is to 
the right and R to the left. Take the second cube: G is at the front and Y is 
at the back, R is on the right and B on the left. For the third cube B is at the 
front with R to the rear, B is on the right and Y on the left. Finally R is at the 
front, G the back, Y is on the right and G is on the left. The stack so formed 
solves the puzzle hopefully with the minimum of insanity! The decision has been 
taken here to give a solution to the puzzle rather than copy verbatim from the 
project report. Much of the style of the report has been retained, but in the 
project report itself the explanation is incomplete and in general lacks clarity. 
It does not mention that the cubes have to be specially designed, otherwise it is 
possible for there to be no solution at all, or many solutions (not that many!). 
It all depends on the character of the combined graph. This student lost some 
marks for the general lack of mathematical depth in the project, but gained 
marks by showing a breadth of application, a readable writing style and good 
presentation. 

The second student concentrated on matching, assignment and described 
the Hungarian algorithm for optimising the assignment of tasks to people, the 
optimisation being in the sense of minimising costs. This student also intro- 
duced bipartite graphs, and if considered as a joint project with the first candi- 
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Figure 4.1 The four cubes and their corresponding graph. 

date this would be a clear case of overlap. However, they were not penalised for 
this in this instance as they had had permission to write two distinct reports. 
Fortunately the overlap was not great and it was blindingly obvious that no 
copying had taken place as the presentation and notation was so different. Here 
is an extract from the project that illustrates the kind of matching problem 
that was tackled. 




Managing Mathematical Projects - With Success! 




Figure 4.3 The two cyclic subgraphs. 
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A store has four specialist departments, Electrical, Furniture, Food and 
Hardware. Each department needs someone to manage it. There are four man- 
agers, Anne, Barry, Carol and David. Each has varying managerial experience. 
Anne could manage Electrical or Furniture but not the others. Barry has never 
worked in the Furniture department but could manage any of the others. Carol 
has experience managing all departments except Hardware. David could man- 
age Electrical or Hardware. The question is how can the Store Manager best 
assign the management jobs? The answer is to draw a bipartite graph with A, 
B, C and D on the left and E, F, G and H on the right. A line is drawn con- 
necting a left vertex (name) with a right vertex (department) corresponding to 
suitability given the above information. The result is the graph depicted below. 
In this instance, it is quite easy to match by inspection (or trial and error). 




Electrical 



Furniture 



Food 



Hardware 



Figure 4.4 

ment. 



The bipartite graph for the matches of manager with depart- 
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For example, one solution is Anne - Electrical, Barry - Food, Carol - Fur- 
niture and David - Hardware. There are three other solutions, but the point 
made by the student a little later is that if there were many more vertices 
a solution by hand would be impractical. In more complex problems it is no 
longer a question of merely either being able or unable to do something but 
there will be varying degrees of suitability. Perhaps each manager has a price 
(the example needs to be more complex for the analogy to work ideally). In 
this case a cost matrix can be defined and each edge in the bipartite graph is 
assigned a value corresponding to the cost of the function on the right being 
done by the particular individual on the left. An algorithm is required to find 
a matching that minimises cost. The project then goes on to explain the Hun- 
garian algorithm for doing precisely this. This is quite long and is not repeated 
here: nevertheless the point is that the student seemed to understand it and 
he gave an example. What was lacking is any generalisation to more complex 
problems. There was also no theory to speak of. The project read more like a 
manual on how to use the Hungarian algorithm. The presentation was good, 
though not as good as that of the first student. However the level of mathe- 
matics was a little higher. This particular student was undertaking computing 
modules, and his layout and treatment of algorithms betrayed this. Overall his 
mark was similar to that of the first student. 

As a whole, the project was in a sense doomed from the start due to the 
entire separation of the two students. There were no marks for co-operation 
between candidates, peer review had to be the same (in percentage terms) for 
each individual contribution which in this instance was separately marked. It 
was important that the students were not disadvantaged by the circumstances. 
However one could argue that their inability to co-operate was a shortcoming 
that ought to be reflected in the assessment. The problem with this is the 
apportion of blame. The module leader could not tell whose fault it was that 
they did not get on, or indeed if it was anybody’s fault as such. To deduct 
marks on hearsay evidence stores up trouble, invites appeals and is simply 
immoral. The clash between them was not able to be forecast, and since the 
entire cohort was small there was no solution other than to let them do two 
individual mini projects. In this sense this project (strictly these two projects) 
belong in Chapter 3. The lessons here are: first, beware groups of two. Make 
sure they really want to work together before deciding on the groups. Second, 
should this situation arise, be pragmatic and put the needs of the students first 
by changing the rules if necessary. In some respects, make the best of a bad 
job! 

In the next case study, we have a second example of one that did not really 
work, but for entirely different reasons. 
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4.5 Fourier Transforms 

Fourier series and Laplace transforms form part of a standard mathematics 
degree course. Fourier series are normally met in year two, either as a stand- 
alone course or as part of a course on differential equations. Laplace transforms 
are either a year two or a year three topic, but sometimes they are absent 
altogether much to the astonishment of electronic engineering students who 
seem to know about them from day one! Fourier transforms are sometimes met 
as a limiting process from Fourier series, but typically this is a single lecture 
and students are not introduced to applications. Fourier transforms are also 
met within a course on integral transforms, but the same comments apply. This 
group project led the student towards practical applications but also hinted at 
more pure mathematical asides. The handout is reproduced below. 



Fourier Transforms 

1. Mathematical Nature 

This is a direct follow on from Fourier series and links to differential equations (ordinary and 
partial), Laplace transforms and eventually to time series analysis and signal processing. 
Reference material is in books e.g. : Fourier Transform and its Applications R Bracewell 
(1986) second edition; and more specialist books such as Fast Fourier Transforms E O Brigham 
(1974). Look under 515.723 in the library, there are lots of books; find one to your taste. 

2. Possible Strategies 

Start by revising periodic functions and Fourier series. Look at the limiting process by which 
Fourier transforms are obtained. Note the conditions on the function, they are quite restrictive. 
Go on to look at specialisms: generalised functions (the impulse function and its relatives); 
applications to signal processing - the spectrum of a signal, the Wiener— Khinchine relations, 
Windowing, smoothing, etc. Then particular applications to medicine, environmental science 
or engineering. 

3. An Example of a Project Plan 

Suppose that there are five team members and call them A, B, C, D, and E. 

A Examines generalised functions, their properties and examples of their use. 

B Investigates the theory behind Fourier transforms, the Riemann-Lebesgue lemma, the re- 
lationship to linear spaces. 

C Researches applications to signals, the Wiener— Khinchine relation (which relates a spectrum 
to autocorrelation). 

D Investigates windows and smoothing (works closely with C). 

E Investigates practical applications in electronics, medicine or whatever. 



The lecture that accompanied the handout covered the following material. 
First of all there was a brief resume of orthogonal function theory, placing 
Fourier series in context. The Fourier transform was derived as a limiting pro- 
cess. The following points were emphasised: the inverse transform is straight- 
forward, but the Fourier transform of such standard functions as the constant 
function, polynomials and trigonometric functions do not exist as the improper 





90 



Managing Mathematical Projects - With SuccessI 



integral diverges. The class was then reminded about the Dirac-5 function as 
an example of a generalised function. Finally, the lecture ended by giving the 
students a description via diagrams of the basic ideas behind the processing of 
signals. 

Armed with this information, a group of three students decided to take on 
this project. Two of them were friends, but mainly due to both failing so many 
modules in previous years! They were re-sitting the year. The third member was 
a mature student, rather distant from the other two both in temperament and 
work ethic. All three struggled with the finer points of mathematics, although 
the mature student obviously had a physics/engineering background which was 
helpful. He also was superb at presentation which proved to be a source of 
tension between the three students towards the end. Therefore this was always 
going to be an interesting group. Interesting in the sense of the ancient Chinese 
saying (curse?) “May you live in interesting times” that is. The scope of the 
project looked to be fine, and there was a possibility of a good outcome. This 
possibility was unfortunately not realised. 

The mature student took on the role of general manager of the project, and 
although there were only two others to manage, he had his hands full from the 
start. He took on the topic of the Laplace transform and its relation to the 
Fourier transform. After (re)introducing Fourier series he defined the Laplace 
transform through: 

>C{/(t)} = / me-P^dt 

Jo 

and derived several properties and elementary results. The impulse function 
and its use were described before the Fourier transform: 

/ OO 
-OO 

together with its inverse 

were introduced. He then went on to develop the subtleties required by signal 
processing, notably scaling properties, symmetry and shifting. Finally the dis- 
crete version of the Fourier transform (DFT) and the fast Fourier transform 
(FFT) were described and used in a simple example. This student obviously 
understood how the FFT is computed. His contribution ended with the intro- 
duction of windows and smoothing and a discussion of how these are applied 
to signal processing. This was a very impressive if isolated piece of work. The 
second student investigated digital signals, examining the Nyquist frequency, 
filtering, and concepts of bandwidth and noise. He discussed the difference be- 
tween amplitude and frequency modulation, and the definition of baud rate. 
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However his contribution was brief and superficial. In particular one was not 
really convinced that he knew what he was talking about! The third student 
concentrated on signal analysis with an account of autocorrelation, spectral 
density and the connection between them, the Wiener Khinchine relations. Fi- 
nally he looked at the application to random signals. Again, the same comment 
could be made. This student’s report was altogether unconvincing. 

The interim reports indicated that the group did not work together and 
two were too brief. There was some overlap between the work of the students: 
however given the track record of two of the students the module leader was 
pleased to get what he did! 

The presentations were acceptable, but once more they were three sepa- 
rate pieces of work with no evidence of co-operation. The mature student did 
attempt to hold the project together, but with limited success. The work of 
the other two students was short of what is normally expected from final year 
students. One was embarrassingly elementary, consisting of little more than 
school binary arithmetic. 

The final report consisted of three separate pieces of work. In the defence of 
the group, the production was halted due to a catastrophic computer failure, 
but nevertheless the outcome was disappointing. Apart from the contribution 
of the team leader which was substantial, the rest was quite elementary math- 
ematically and too short. There was absolutely no attempt at joining the three 
pieces of work together and the overlaps remained. Clearly there had been very 
little discussion and two of them were very much done at the last minute. The 
module leader suspects that the leader was tearing what little remained of his 
hair out cajoling something out of the other two students before the deadline, 
although in fairness to the group they all insisted each put in equal effort under 
the peer assessment process. This is one project where the unbiased input from 
a dispassionate second assessor would have been particularly useful. Sadly, be- 
cause this project ran in the first student cohort, such quality systems were not 
in place and it was not done. 

The project as a whole was a pass at the lower second level. Both asses- 
sors were aware of the practical difficulties experienced, and allowances were 
made because the catastrophic computer failure stemmed from a virus that 
was picked up on a university computer in the open access area! The leader 
was awarded a first class mark, but the others got lower second class marks. 
However with hindsight the marks awarded were perhaps over-generous. Their 
final degree grades were lower in every case. 

There was quite a lot to learn from this group project. Perhaps in this case 
it was not a good idea for these three individuals to operate as a group. The 
leader was much older and wiser than the others, but unfortunately he also was 
not the best communicator in the world. Although his personal contributions 
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were excellent in quality, well presented and timely, this tended perhaps to 
intimidate already struggling students who felt that they could never match 
this performance. The leader was also not the best at encouraging and bringing 
on his younger colleagues, not least because he was himself finding some of the 
more advanced mathematics difficult. That he did so well was entirely due to 
sheer hard graft not because of innate mathematical ability. He certainly should 
have opted for an individual project. One might say this is indeed what he did! 
The other two got through, just. As their performance in this module was not 
out of line with their performances elsewhere, the outcome here could be said 
to be as expected. 



4.6 Orbital Motion 

It is sad to reflect, speaking as an applied mathematician, that it is quite com- 
mon these days for there not to be much in the way of compulsory mechanics 
in many mathematics degrees. The positive side to this is that there are many 
mechanics based topics which are familiar to the lecturer but outside present 
day syllabuses that can provide rich source material for projects. The specifi- 
cation for a project based on mechanics has therefore to be very broad. The 
decision was made to try and encourage those whose mathematical preferences 
lay more towards pure mathematics to dip their toe into applied mathematics. 
To do this, the emphasis of the handout was on the Lagrangian description of 
mechanics. It is reproduced here. 

To accompany this handout, the lecture began by discussing Newton’s laws 
of motion, particularly the second law. Configuration space was defined and the 
notion of degrees of freedom was explored. The concepts of potential energy 
and kinetic energy were outlined and students were reminded of the conserva- 
tion of energy. The Lagrangian was defined and calculated for simple systems 
such as the pendulum and motion under gravity with which the students were 
(should have been) familiar. The Hamiltonian and general principles such as the 
Principle of Least Action were discussed. The quadratic form was revised, and 
students were shown how the Lagrangian of a linear vibrating system avoided 
many of the difficulties of the traditional method and was amenable to calcula- 
tions involving the matrix algebra associated with eigenvalues and eigenvectors. 
Finally the Poisson bracket and its extension to quantum mechanics was men- 
tioned, though no details were given. Emphasis throughout was on the very 
general application of Lagrangian mechanics. 

Virtually the entire class was seduced by this description, and in order 
to avoid disappointment, four parallel projects involving mechanics ran in this 
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Analytical Dynamics 

1. Mathematical Nature 

This project has the potential to be entirely analytical and can be viewed as an application of 
linear algebra (matrices, eigenvalues etc.) 

Reference material is Classical Mechanics by T W B Kibble and F H Berkshire (1996)(4th 
Edition), Introduction to Analytical Dynamics by N M J Woodhouse (1987) and Classical 
Mechanics by H Goldstein (1980) (2nd Edition). The code 531 in the library should reveal 
many others and it is worth a browse either on foot or electronically. 

2. Possible Strategies 

Start from the concept of a dynamical system whereby a system is described by the position 
and velocities of its constituent particles. This is a function of n variables. The general field of 
analytical dynamics is about determining equations obeyed by these functions. It can be quite 
“pure mathematical” and only relate to Newtonian mechanics when specific applications are 
tackled. Historically it arises from the work of Joseph Louis Lagrange in the late 18th century. 
French mathematics around the time of the revolution was very active, very good and by and 
large pure mathematics. 

3. An example of a project plan 

Five team members A, B, C, D and E. 

A Researches canonical variables and Lagrange’s Equations 
B Researches Hamilton’s Equations and the Principle of Least Action 
C Researches modern dynamical systems in the context of analytical dynamics 
D Relates analytical dynamics to Lie Algebras, Astronomy, Relativity or Quantum Theory 
E Solves some simple problems using Lagrange’s Equations. 



particular year. Obviously they had to be quite distinct. One was on the analysis 
of a spinning top and associated gyroscope. One was on deriving Hamilton’s 
equations and how these are extended to include relativistic effects. Two of 
them did not in fact involve very much in the way of analytical dynamics 
but were traditional mechanics. One involved the application of dynamics to 
astronomy, and only touched on a Lagrangian description near the very end, 
and the fourth project focused on orbits and their stability. This last project 
is the one we shall describe in more detail here. The fact that it deviated very 
much from the handout is not a problem. The object of the handout is to 
give the student or group of students ideas, and if they drift away from the 
given description all is well provided the lecturer approves the final plan. In 
past times, a project based on orbits would have overlapped considerably with 
second year, perhaps even first year, mechanics. This is not so obviously the case 
these days, but there will be some traditional mathematics degrees in which 
the overlap will still be there. In Plymouth there is a little on orbital motion 
in the second stage of the degree, but not a significant amount. Certainly not 
enough to trouble those concerned with projects duplicating taught material. Of 
course, even if the course has a traditional mechanics based module with much 
on orbits, the possibility of pursuing projects based on orbits is still there but 
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has to reflect the more advanced student knowledge. For example, something on 
the stability of satellites perhaps involving numerical techniques would always 
be a possible project. Another option would be to look at relativistic effects, 
exploring the motion of Mercury. The students who tackled this particular 
project had a little previous knowledge, but decided on a project scope that 
covered a broad range within the general area of orbits although there was an 
attempt at an analysis of the stability of orbits. The three students (all female 
as it happens) really did work together writing at least a chapter each, but 
combining over others. The interim reports were all very well done, which was 
not surprising given their academic records. The verbal presentation is best 
described as courageous. It could have been brilliant, but unfortunately they 
lacked the bravado and it was not entirely the right vehicle. What they did 
was to present a mock interview where one student questioned one other whilst 
the third used the overhead projector. The “expert” was by common consent 
the most able academically and obviously understood the subject thoroughly, 
but the style of the presentation was not geared to bring out the knowledge of 
the other two. The questions could have been better thought out. The advice 
of the lecturer was that this was a novel idea that did not really succeed. It 
would have been more at home in an Open University summer school where 
the emphasis is perhaps more on enjoyment and group participation! 

The final project report was well done. It opened with a chapter on history 
and astronomy which was scholarly. This finished with Kepler and his laws. In 
chapter 2 the polar co-ordinate system was introduced and the acceleration 

^m{r — r0^}, ^ 

derived. Then followed two chapters on the geometry of conics including de- 
tailed descriptions of the ellipse. Then Kepler’s three laws were proved. This 
included the proof that orbits were ellipses using reciprocal polar co-ordinates. 
There were some unfortunate layout problems at this point. The chapter head- 
ings became inconsistent, the page numbering disappeared and the numbering 
of equations was confused. This was a consequence of lack of time and some 
printer problems towards the end of the project. The final chapter concerned 
the stability of orbits and took the project to another level of difficulty. Meth- 
ods of expansion were used and the motion of the disturbed body compared 
to the simple harmonic oscillator in order to And periods. In the project, the 
(radial) equation of motion of an orbit in the form 
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was expanded using r = x + a, and the equation 

mx - ^^/(a) + /'(a) 

analysed for stability using a general power law. Overall the students were a lit- 
tle disappointed with the final project, mainly due to the production problems 
leading to the style and layout glitches. The assessors read the project and were 
in general pleased with its mathematical content. However, the mathematical 
standard of the work on the stability of orbits was some way above the rest. 
Care had to be taken not to use this as a negative influence, condemning the 
rest as not worthy of a similar grade. The problems with the layout were enough 
to bring the overall mark down from first to upper second. The message here, 
given time and time again, and ignored by students everywhere is not to leave 
things to the last minute. In their finals, one student got a first and the other 
two awarded upper seconds and in every case the group project mark was their 
worst, so maybe their disappointment was in some way justified! 




4.7 Conclusion 

This chapter is an account of the experience of the author in running group 
projects in mathematics. In some senses, a group project in mathematics is 
much the same as a group project in another discipline, say engineering. How- 
ever there are distinctive features. In engineering, the partitioning of tasks 
needs to be much more carefully done and the whole more managed. There is 
usually something to be made involving technicians who have other things to 
do and their own agenda, and there is usually an end product that should be 
tested. The engineering group project is more multi-disciplinary and aspects 
such as health and safety, and economics (cost) rear their ugly heads. The 
author was, in a previous existence, responsible for inter-disciplinary group 
projects in engineering so these are words written from experience. What the 
mathematics group project gives is the opportunity for the student to work with 
fellow mathematics students. This helps them to be aware of the strengths and 
weaknesses in others and themselves, to tolerate both and to work towards a 
common goal. It also gets the students introduced to managerial skills such as 
conducting themselves in meetings perhaps even chairing them, managing their 
time, keeping to deadlines set by themselves, and listening to others. With few 
exceptions, students who have participated in a group project have found it a 
very worthwhile experience. 

In the next chapter, case studies are explored. These are in a sense a half 
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way house between a group project and a standard lecture course. We finish 
this chapter with a section on further suggestions for group project topics. 



4.8 Further Suggestions 



Normally, at this stage there would be a number of exercises on the contents of 
the chapter, but as in the last chapter that does not work here. Instead there 
follow a few suggestions for group projects. Some of them have been tried, but 
some await trial. 

1. Financial Mathematics 



With the Thatcher era of the 1980s, there was a surge in interest in business 
and finance. In particular, a number of research mathematicians became 
interested and successful in applying mathematical modelling to the be- 
haviour of the stock market. In particular, a number of successes were 
attributed to the Black-Scholes equation: 



5 / 

dt 



+ rS 



dl 

as 



1 2c2^ _ f 

s rf 



which describes the behaviour of the price / given information about stocks 
S. The rest of the notation is r is the interest rate and cr is a measure of 
the volatility of the stock and is related to the standard deviation of the 
proportional change in stock price in an increment of time. There is a great 
deal of scope here for group projects. All the students need to have an in- 
terest in and understand finance and investment. A group of five can then 
look at the following tasks. One compares and contrasts discrete and con- 
tinuous models of finance. This student could also look at the history of 
financial modelling. One student could investigate Ito calculus which is a 
calculus developed by Kiyosi Ito (1915- ) to describe stochastic variation. 
He developed it for biological and physical systems, but it is well suited 
to model finance. This student could use Ito calculus to derive the Black- 
Scholes equation, but this does require a background in probability and the 
analysis of time series. A more direct approach based on seven assumptions 
is probably a better alternative. The third student could solve this differen- 
tial equation in a variety of cases. It transforms into the standard diffusion 
equation, but the boundary conditions are troublesome. Laplace transforms 
can also be used, but inversion is hard and the solution usually only ob- 
tained in terms of an integral. The fourth student can examine numerical 
methods of solving the Black-Scholes equation. This can involve setting 
up a semi-implicit Crank-Nicolson scheme and writing a program either in 
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FORTRAN or a language such as MAPLE. Finally the fifth student should 
look at the shortcomings of the Black-Scholes model and investigate alter- 
native models. It is certainly true that the Black-Scholes model (sometimes 
called the Black-Scholes-Merton model) does not seem to work now and 
is in need of modification. This is a clear case of the application of a model 
infiuencing the behaviour of the system itself. Perhaps students could think 
along the lines of Heisenberg’s uncertainty principle! 

2. Normal Modes 

The whole concept of normal modes or natural frequencies/resonance 
makes a good topic on which to centre a group project. Presumably, most 
students would have met the concept of resonance when discussing the so- 
lution of the equation for simple harmonic motion. Normal modes might 
have been met in a particle mechanics course or more theoretically in a lin- 
ear algebra module in connection with the application of eigenvalues and 
eigenvectors. So the idea that a linear system possesses natural frequencies 
that can be excited by forcing at these frequencies should not be entirely 
new. The introductory talk can be adjusted to cater for the particular needs 
of the student body and to get them up to an appropriate level. Let us look 
at some student tasks that can be assigned within this general heading. 

One sub-project could be to examine the solution of the Helmholz wave 
equation 

(V^ -P A2)(/> = 0 

where A = w/c, lo = frequency and c = celerity (wave speed). If this is 
solved in one dimension then there is one natural frequency Ac which is 
the single “normal mode” . If solved in two or three dimensions under a 
cartesian co-ordinate system, then there are nodal lines or planes corre- 
sponding to a standing wave solution when the whole vibrates in a finite 
domain. From a mathematical point of view the solution to the Helmholz 
wave equation in cylindrical geometry is of interest as it yields Bessel’s dif- 
ferential equation. In spherical geometry a variables separable assumption 
results in Legendre’s equation. In both of these cases, there are a wealth 
of results that can be derived and which take the student through some 
of the properties of these special functions. Particularly important are the 
orthogonality relations which in turn lead to Fourier-Bessel series in the 
one case and spherical harmonics in the other. The graphical capability 
of the computer can be used to good effect to visualise these oscillations, 
and if practical applications are desired there are plenty. The vibration of 
a membrane, the propagation of electromagnetic waves, and geophysical 
oscillations are all worth exploring. 
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On a different tack there is much mileage in exploring the difference be- 
tween linear and non-linear systems. Linear systems vibrate at natural 
frequencies that are independent of the initial conditions that get them 
going. On the other hand non-linear vibrations oscillate at frequencies that 
are dictated by initial conditions. As an example, one could use van der 
Pol’s equation: 

X + e(x^ — l)a; -P a: = 0 

where e is a constant (which could be assigned the value ^ say) and explore 
the concept of a limit cycle. Again, one must be lead here by the taught 
modules experienced by the student to avoid duplication. 

Another avenue to explore might be the connection between normal modes 
and Green’s functions. If Green’s functions are nowhere to be seen on 
the course, then an elementary treatment might be expected whereby the 
Green’s function of a system is defined as its response to an impulse, i.e. the 
system’s response to being struck. More advanced approaches will involve 
two or three dimensions and solutions in various domain shapes. 

Finally there is the role of damping in reducing the amplitude of oscillations 
at natural frequencies or normal modes. The student would have already 
met mass-spring-damper systems, in which case more complex systems 
need to be studied. This subject could involve examination of seismograph 
records to determine damping coefficients. 

3. Water Waves 



Some mathematics courses have modules devoted to waves in general or to 
specific kinds of waves such as waves on water, electromagnetic waves or 
waves on membranes. This possibility is mentioned in connection with the 
case study on traffic waves in the next chapter, but only a few specialist 
mathematics courses will have anything except one dimensional waves on 
strings. Water waves will then provide an opportunity to link the develop- 
ment of wave theory with fluid mechanics. 



One student will certainly have to concentrate on linearising the fluid me- 
chanics, including the dynamic boundary condition (pressure is a constant 
on the free surface) and deriving that one does indeed need to solve the 
wave equation: 






1 



even if it is restricted to one dimension: 
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In addition, there could be an investigation of the perturbation technique 
for solving the non-linear problem that has been used so successfully over 
the years by Civil Engineers to design codes of practice. 

This leads to a second exploration that moves squarely into the realm of 
Civil Engineering, that is the calculation of wave forces. The use of various 
drag laws, the approximations and assumptions made are all important 
aspects for the student to understand. Wave forces on fixed and floating 
structures, how such forces vary with depth as well as the impact of waves 
on the shore are all possible topics here. The use of design loads and the 
use of engineering safety factors gives a student mathematician useful in- 
sight into the real world of the engineer. It is also possible to extend this 
investigation into the calculation of wave forces due to a spectrum of waves, 
although a great deal of guidance is usually required. Engineers do tend to 
oversimplify such problems. 

Another student could examine wave ray theory and its role in deriving 
results in water waves. Non-linearity and caustics can be investigated here 
too. On a more practical level, using wave ray theory to investigate the 
approach of a wave train to a beach or in the design of harbours and marinas 
also leads to fruitful mathematics. Depending again on their background, 
the student could tie this up with the theory of optics. 

Finally, there is the application of water wave theory to open channel flow 
and the whole area of river hydraulics. This is a vast topic and usually 
constitutes a large part of hydraulics in a Civil Engineering degree. Useful 
areas for a mathematics student to concentrate on are the role of waves in 
moving sediment, the interaction between waves and river flow (bringing in 
the Froude Number and stability criteria) and finite and variable depth. Al- 
lowing the depth to vary in some idealised way can lead to special functions 
and other interesting mathematics. 

4. Numerical Solutions to Differential Equations 

These days, all mathematics courses introduce students to the numerical 
solution of ordinary differential equations. This usually takes place in the 
second year where the syllabus covers Euler’s method, modifications thereof 
and Runge-Kutta methods. Formal examination questions on this topic are 
limited, so there is usually a coursework involving exploring the solution 
to some problem which calls (using FORTRAN or MAPLE perhaps) a 
routine for solving ordinary differential equations. It is usual for the student 
to have written a simple program to solve a boundary value problem in 
one spatial variable, but for complex situations commercial software (NAG 
routines) are called. Again, different universities will have varying amounts 




100 



Managing Mathematical Projects - With Success! 



of this type of material, so the initial introductory talk has to start from 
a carefully set level. In the unlikely event of there being no numerical 
solutions to ODEs anywhere on the course, then all students can start 
with the same clean sheet. Here, let us assume all students are familiar 
with Euler’s method and Runge-Kutta methods. 

One student could explore the different truncation errors of the different 
methods. Using Euler’s method as a reference, the second and perhaps 
fourth order Runge-Kutta methods could have been covered in class. In 
which case the student needs to calculate the error of more sophisticated 
methods such as the multi-step Adams-Bashforth and Milne methods. 

Another possibility is to see how the numerical methods developed for 
ODEs generalise to certain types of PDE. The first step is to tackle second 
order ODEs if this has not yet been done. After this, there could be a 
need to restrict attention to explicit methods in time or elliptic equations 
which do not involve time. The calculation of errors is not all that different 
between ODEs and PDEs, but there is plenty of scope for projects to 
explore different methods of computation. There is also the opportunity for 
the student to produce visual aids for understanding the methods of solving 
two dimensional or even three dimensional problems. If the emphasis is on 
studying different methods it might be best to restrict the study to one 
equation, say Laplace’s equation 

V2(/) = 0. 

On the other hand, the student could tackle a real problem and solve it 
numerically. A laboratory set-up such as a floating cylinder is reasonably 
straightforward: a simplified storm surge prediction based on shallow water 
of constant depth is harder, but not impossible. 

Given that numerical methods for solving ODEs and PDEs usually end up 
with having to invert large matrices which may or may not be sparse, a 
student could look at the problem of large matrix inversion by numerical 
means. This could include the development of sparse matrix solvers, use of 
complex arithmetic and the development of user friendly software. 

5. Tops and Gyroscopes 

As was mentioned in Section 4.6, there is often very little compulsory me- 
chanics in a mathematics degree course these days. Often the amount cov- 
ered only equates to what used to be in ‘A’ levels thirty odd years ago. 
Therefore a third year mathematics student typically has sparse knowl- 
edge of Newton’s laws as applied to particles, including perhaps orbital 
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motion (but see Section 4.6). He or she also knows about moments of in- 
ertia and can calculate them as examples of multi-variable calculus but 
typically cannot analyse rolling bodies. Given the restricted nature of the 
topic, this is only really suitable for a group of five students if there is very 
little mechanics in the rest of the course. Let us assume all students know 
about particle mechanics, and understand the concept of moment of inertia 
but little else. 

The first important step to take is for all the students to be familiar with 
Euler’s dynamical equations. This is an instance where the entire group 
needs to get to grips with a new piece of work and where they need to elect 
a member who can write it up and convey the result to the rest of the group 
in an understandable way. The first student undertakes to derive Euler’s 
equations and perhaps goes on to solve these equations in the special case 
of a spinning axisymmetric top. Precession is important, nutation less so. 

One student can look at the life and times of Lagrange and how analytical 
dynamics was developed as a consequence of applying mathematically rig- 
orous thinking to Newton’s laws. This student would also need to develop 
rigid body mechanics of the type that would lead up to Euler’s equations, 
that is two dimensional rolling bodies. 

A third student could look at applications such as the principle of the gy- 
roscope and the development of the gyrocompass. Some practical problems 
would then be solved. 

There is potential for this project to go on to feature Lagrangian and Hamil- 
tonian dynamics, especially if quantum mechanics features as a taught 
module where the Poisson bracket is important. In general it is better to 
link group projects to work done by all the students. This is beneficial and 
a project that introduces the Hamiltonian and the concept of action can 
be very useful in the study of quantum mechanics. 

6. Boolean Algebra 

Boolean algebra has an interesting place in mathematics. The underlying 
concept in the form of the algebra of binary numbers is met at school. 
However, sometimes there is no Boolean algebra as such in any of the three 
years of a mathematics degree course! Most students meet abstract algebra 
(groups, rings and fields) and Boolean algebra can be introduced as an 
example. However, it has such a distinctive feel that this group project can 
virtually be started from scratch. 

One student can look at the definition of union and intersection, the basic 
laws of associativity, distributivity and commutativity, complementarity 
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and the De Morgan laws. 

One student can look at the application to switching circuits and the wealth 
of applications in electronics. This nicely leads on to NAND and NOR gates 
and the use of Boolean algebra to simplify circuits. How to design a circuit 
to control the central lights in a room with three doors is a favourite. 
Designing lift buttons so that the nearest one out of three or four comes is 
another. 

The applications of Boolean algebra to propositional logic and the design 
of expert systems is another fruitful area for a student. This could also 
include an analysis of the various methods of proof which by this stage 
should be familiar to the student. 

There will be a certain amount of interplay between this group project and 
any option on control theory that might be running. Again it is up to the 
lecturer to keep an eye on potential overlaps. 

7. Archimedes and Apollonius 

This group project cannot run if the history of the mathematics of 
Archimedes and Apollonius forms part of a module. These two encompass 
the development of mathematics (principally geometry) in ancient Greece, 
and the subject area is so vast that only a few hints of possible sub-projects 
are given. 

Here are a few investigations linked with Archimedes: 

the Principle of Archimedes (hydrostatics and the famous “Eureka!” mo- 
ment); 

the method of indivisibles and his calculation of areas, predating the cal- 
culus by 2000 years; 

the Archimedian solids; 

his work on cycloids. 

for Apollonius we have: 

his work on ratios; 

the conic sections; 

various astronomical calculations including cycles and epicycles, and those 
involving comparing the dodecahedron and the icosahedron. 

There are many other possibilities. In general the History of Mathematics 
can provide a good vehicle for both group and individual projects, but 
there is a health warning. Make sure that each student engages with the 
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mathematics and not just with the person and the culture of the time. As 
important as these latter are, this is a mathematics group project. 




5 

Case Studies 



5.1 Introduction 

If the idea of plunging straight into student centred projects seems too much 
of a leap into the dark, an alternative is to provide case studies. This chapter 
shows how to do this largely through examples. The traditional lecture course 
(or “module” in these days of unitised courses) consists of a set of aims with 
the expected learning outcomes together with a syllabus. The student attends 
lectures, and there are usually (but increasingly less frequently) problem classes 
where the student can receive help with prescribed problems set by the lecturer. 
The whole is wrapped up by an end of module examination. For most modules 
there is in addition a coursework element whereby each student independently 
is assessed on a longer piece of work which has to be completed over about 
three weeks or a month, but this rarely exceeds 25% of the total assessment 
load. Commonly, one piece of coursework is set to be done two thirds of the way 
through the module. The exception to all this is numerically based mathematics 
which lends itself more naturally to coursework based assessment rather than 
the formal end of unit examination. Modules involving largely numerical work 
can be entirely coursework assessed, but 50% is more common. Of course, the 
statistics parts of a mathematics degree are peppered with potential case study 
material but these are outside the scope of the present text. 

The case study approach is in a sense a half-way house between a formal 
lecture course and a group project. The lecturer introduces a scenario, usually 
taking two or three lecture hours to do so. After this presentation, each student 
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working in isolation or in groups is assessed on their understanding of the study. 
There is no extra research by the student, but the student is expected to solve 
some extra follow-up material not presented by the lecturer. 

Assessment is usually via coursework but there could be an examination 
too. There are some further words on this in the chapter summary at the end. 
Some suggestions for assessment tasks are given for each of the case studies. 
These take the form of alternative analytical forms, alternative assumptions, 
considering one feature and exploring further, etc. Here is the first of six ex- 
amples of case studies. The form this chapter takes therefore is a sequence of 
quite detailed descriptions of case study examples. These are designed mainly 
for the lecturer so that they can provide a vehicle for a class based case study, 
but they will also be useful for students undertaking the studies. It could be 
the opinion of some that the description given is a little overfull and lands the 
lecturer with the problem of not leaving enough for the student to do. This au- 
thor would rather err in this direction than not giving enough detail. Besides, 
all the examples are open ended and it is always possible for any lecturer worth 
his or her salt to extend the problems. Let us introduce the first example now. 



5.2 Ocean Surface Dynamics 

This case study is unashamedly based around the research interests of the 
author. It concerns the way the ocean responds when a wind blows over it. 
It requires students to be familiar with vector calculus, differential equations 
and, to some extent, fluid mechanics, although the amount of fluid mechanics 
required as a prerequisite can be minimal and catered for by the style of delivery. 
The most difficult aspects for students to follow are not the technical facility 
with any particular mathematics, but with what can be neglected and how to 
model what is retained. There are textbooks about how to model the movement 
of the ocean, most of them intended for postgraduate researchers which are too 
sophisticated for even final year students, so the case study has to be presented 
carefully. 

The first place to start this case study is with a description of the acceler- 
ations and forces that act on the ocean. The atmosphere and ocean are fluids 
attached to a rotating earth by gravity. There are three accelerations: one due 
to the accelerating fluid which is called the particle acceleration, the second 
due to the rotation of the earth called the Coriolis acceleration and a third due 
to any fluid particle not being isolated (it is surrounded by other particles in 
much the same way as a rigid body although it is not rigid of course): this is 
the advective acceleration. The first of these is the rate of change of velocity 
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and is neither a difficult nor a new concept. The other two are trickier. The 
third acceleration is not important as an acceleration, but provides the source 
for the turbulence model to follow later. We return to it then. The second is 
important and the following mathematics describes how it is included in the 
equations of motion. First of all Figure 5.1 shows the earth and its rotation 
about the North-South axis. An arbitrary point on the earth’s surface is la- 




X (East) 



Figure 5.1 Motion relative to a rotating earth. 



belled O and is the origin of local {x, y, z) co-ordinates, x points east, y points 
north and .2 points up. In this co-ordinate system shown in Figure 5.1 the ve- 
locity of the fluid is u, but of course the origin is moving with respect to the 
centre of the earth. (The earth’s centre will be assumed fixed as its motion 
around the sun has such a large radius of curvature that it is straight to a very 
good approximation.) The motion of the point O is by virtue of the angular 
velocity of the earth. In the plane of the line of latitude that contains O, the x 
and y axes rotate with angular velocity fl sin0 where 9 is latitude. This is the 
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vertical (z) component of 17. The vector 17 has direction south pole to north 
pole. Thus, when the derivative (rate of change) with respect to time of any 
vector is calculated, and this calculation is performed relative to the rotating 
axes, the changes of the unit vectors that point along Ox and Oy, call them 
i and j, need to be taken into account. So, take any vector A(t) which is a 
function of time. Then the calculation of its rate of change proceeds as follows: 

dA{t) _ d{Aii + A2J) _ dAi . (IA2 . , , I A 
dt dt dt ^ dt ^ ^ dt ^ dt’ 



where A\ and A2 are the components of A(t) in the x and y directions respec- 
tively. By examining how the axes change in an infinitesimally small time, it 
can easily be shown that 



di 

dt 






and 



dt 



= — I7i 



and hence 



dAjt) 

dt 




dA2 . ^ ^ A • 

— — j + l 7 Aij — 17A2I. 
dt 



The first two terms are the rate of change of A{t) with respect to time as if 
the axes were not rotating, so the extra two terms represent the effects of the 
rotation. These can be put succinctly in terms of vector quantities by using the 
cross product. Thus we can write: 



'dA{ty 




'dA{ty 


dt 


fixed 


dt 



17 X A{t). 



rotating 



The derivation of this has been two dimensional and may not look very gen- 
eral. However, it is mathematically rigorous as the above vector equation is 
co-ordinate independent, and a mathematical result that is co-ordinate inde- 
pendent but derived using specific co-ordinates (in our case two dimensional 
Cartesian co-ordinates) remains true in all orthogonal curvilinear co-ordinate 
systems. This is a useful theorem and illustrates the usefulness of pure mathe- 
matics (as if that were necessary!). 

The Coriolis acceleration is now easily derived by a double application of 
the formula. If r denotes the position vector relative to an inertial frame of 
reference (the centre of the earth for example) then we have 



d dr 
dt^ dt dt 

where all derivatives are relative to fixed axes. The left-hand side is the true 
acceleration. In terms of actually measurable quantities which are of course 
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measured relative to a rotating frame of reference we thus have: 
d dv{t) 



dt dt 



1 + 



d?v{t) 



d 

dt 



dt"^ 



fix 
2fl X 



r(t) 



J rotating 

dr{t) 



J rotating 

The Coriolis acceleration is the term 

dr{t) 



dt 



2fl X 



dt 



rotating 



17 X (17 X r(t)). 



J rotating 



The term 

17 X (17 X r(t)) 

is the centripetal acceleration which is directed towards the axis of rotation of 
the earth but is so small compared to gravity that it is usually ignored. We 
have also assumed that the angular velocity of the earth 17 does not vary with 
time. (Otherwise there would be yet another term to consider.) 

With respect to the chosen co-ordinate system (see Figure 5.1) the Coriolis 
acceleration is 



217 X u = 2(0, 17 cos 0, ft sin 0) x (u, u, w) 



where {u, v,w) = u is the fluid velocity. The double use of the symbol u is less 
confusing than alternatives, u is the easterly current, v the northerly current 
and w the upward current. This latter is of course very small and is neglected 
except for very specialist modelling, w is always neglected when compared 
with u or V, and it is this approximation that gives the Coriolis acceleration the 
components {—fv, fu, 0) where / = 212 sin 6. f is called the Coriolis parameter, 
it is twice the vertical component of the earth’s angular velocity, and it is only 
this component that plays an important part in the dynamics of the ocean and 
atmosphere. In a mathematics course it is usual to give the details. Expanding 
the vector cross product we get 



217 X u 



j 

ft cos 6 

V 



k 

12 sin d 
w 



2{ftw cos 9 — 12w sin 9, 12w sin 0, —12m cos 9). 



As w « V the 12 wcos9 term in the first component may be safely neglected. 
Moreover, vertically the balance is between gravity and the vertical pressure 
gradient. As these both have a magnitude of about 10^ times 12m cos 9 this latter 
term too can be cast aside. This vertical balance is called hydrostatic pressure 
and expresses that the pressure at a point in the sea is due to the weight of 
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water above it rather than any dynamic effects, be they Coriolis or anything 
else. The result is that only the vertical component of the earth’s rotation enters 
the dynamics and as we stated above the Coriolis acceleration can be expressed 
as {—fv, fu, 0) to a good degree of accuracy, where / = 2 sin 9. 

In order to see what forces act on the ocean we consider an infinitesimally 
small cube of sea. Gravity acts to pull this downwards and there are pressure 
forces acting that give forces across opposite faces of the cube proportional to 
the gradient of the pressure. Both of these are standard forces which students 
would have met in a course on fluid mechanics. If this is new to the students, 
then elementary vector calculus of the type 



- I pghdV + ( {—pn)dS = — f pgkdV — j ^pdV = j {—pgk — ^p)dV 

Jv Js Jv Jv Jv 



needs to take place, (k is the unit vector in the vertical direction, and n is a unit 
outwardly drawn normal to the infinitesimal cube.) The final force to consider 
is due to friction. The representation of stress in a fluid will be new to most 
students, so some detail is required. However we first return to the advective 
acceleration. For those familiar with fiuids, this manifests itself in the difference 
between standard time rate of change and rate of change following the fluid. The 
first is termed the Eulerian view, the second the Lagrangian view. Mathematics 
students are often happy with the following explanation in terms of calculus. 

Consider a function of four variables, x,y,z,t say F{x,y,z,t). Straightfor- 
wardly it has a time derivative 

dF 

This expresses the way F changes explicitly with time t. However, suppose that 
X, y and z all depend on t too. The total derivative of F with respect to t would 
then be expressed via the chain rule as 



dt 



or 



OF dx 


OF dy 


dF dz 

1 


dF 


dx dt 


^ dy dt 


dz dt 


dt 


dF 


-- (u.V)E-P 


dF 




dt 








/ dx dy dz\ 




11 = 


Xdt'dt'dt) 





Here we have used that 



Putting the arbitrary F equal to the velocity u now gives the result 

du du , 

^ = a + 



The first term on the right hand side is the Eulerian rate of change often 
expressed as the rate of change observed from a fixed point. The left hand side 
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is the total or Lagrangian rate of change often expressed as the rate of change 
following the fluid. The second term on the right is the difference, which in 
some contexts (water waves) and after some correct time averaging is called 
the Stokes’ drift. It is now useful to write down the vector equation of motion 
for the ocean. It is the fluid equivalent of Newton’s second law, called the 
Navier-Stokes’ equation, for a rotating fluid. 

^U. 1 

— — h (u. V)u + 217 X u = — Vp + g + friction. 

at p 

Let us now consider the form that friction takes. We shall then be ready to 
look at this equation in component form. 

The “friction” being discussed here is not molecular friction of the type met 
in mechanics when objects slide or do not slide down planes, but turbulence. As 
there are learned books devoted to modelling fluid turbulence (e.g. the classical 
text by A. A. Townsend (1954)) only a brief introduction is given here. The 
basic idea in applying turbulence ideas to modelling the sea is that any current 
in the sea can be thought of as a mean flow that may be time dependent 
but is steady on a short time scale of a few seconds, plus a small random 
deviation. Mathematically u = u + u' where u' = 0 and the overbar denotes 
the mean taken over a few seconds. This is now inserted in all the governing 
equations and the overall time average (again over a few seconds) taken. All 
is as expected, except for the non-linear advective acceleration which produces 
the extra non-zero term 

(u'.V)u' 

on the left hand side of the Navier-Stokes’ equations. The small random devi- 
ation u' is of course unknown, but fortunately the non linear non-zero term is 
the only place in the equations where it occurs. It is called the Reynolds stress 
term and although it may look harmless, it actually contains 9 terms consisting 
of all pairs of combinations of the components of u' = {u',v',w') with itself. 
One of the simplest assumptions to make is to relate appropriate parts of this 
Reynolds stress to gradients in the mean flow. The Reynolds stress is in fact a 
tensor (strictly a second order tensor) which is made up from the nine compo- 
nents and can be represented by a 3 x 3 matrix. It can be succinctly written 
Tij where the suffices i and j independently run from 1 to 3. In detail 



/ rn 


Ti2 


T13 \ / 


—pu'v' 


—pu'w' \ 


T21 


T22 


T 23 1 = 1 -pv'u' 


—pv'"^ 


—pv'w' 


V T31 


T 32 


T 33 / V —pw'u' 


—pw'v' 


—pw'"^ J 



where each in fact measures the covariance (statistical measure of agree- 
ment) between two of the fluctuating components u' = (u',v',w'). If i = j it 
is the autocovariance of u' , v' or w' that is being measured. Both matrices are 
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of course symmetric (rjj = Tji). This follows straight away from the right hand 
matrix. In fact if i = j then the stress is called a normal stress. Since normal 
stresses act in a similar fashion to pressure, they can be safely overlooked (or 
more accurately absorbed by the pressure). When z yf j the stress is a shear 
stress and it is these that need to be modelled. It is of course the gradients 
of the stresses that contribute to the conservation of momentum in much the 
same way as does the gradient of pressure and not the pressure itself. Turbu- 
lence and friction have in common the transfer of momentum in a direction at 
right angles to the flow. The magnitude of the rate at which this transfer takes 
place in a turbulent ocean current is what is attempted to be modelled by eddy 
viscosity. The big assumptions are one, that such a relationship is reasonable 
and two, that the relationship is linear. Neither of these can be justified ex- 
cept perhaps a posteriori. To home in on a particular component, let us choose 
ti3(= T31). In this case, the eddy viscosity assumption, a turbulence equivalent 
to assuming a Newtonian viscous fluid, leads to 



T31 = -pw'u' = 

where Vy is the eddy viscosity or Austauch coefficient. It is the turbulent equiv- 
alent to kinematic (not dynamic) viscosity. Not only is it much bigger than 
normal viscosity, usually by a factor of 10®, but it is of course not a fixed 
property of the fluid. The gradient of the stress and not the stress itself gives 
the net force. No apology is necessary for reiterating that this is the same as 
gradients of pressure, not pressure, itself causing net force. Hence if the overall 
force balance is to include shear stress effects then terms such as 

d d f du\ 

dz \ “ dz ) 



need to be included. If the eddy viscosity Vy is a constant, then this can be 
written in terms of a second derivative: 

d^u 



The stress is a stress that represents the shear due to a current travelling 
in an easterly direction in the presence of either the sea bed or the sea surface. 
Other shear stresses, for example T 21 would represent the shear due to an east- 
erly current near a north-south coast, could also be put in terms of gradients 
in mean flow: 



T21 = —pv'u' = 



-pVH 



du 



but this time the eddy viscosity vh is representative of the horizontal transfer 
of momentum in the current. The horizontal eddies that effect this transfer 
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are correspondingly larger than the vertical counterparts, hence i^h » I'v 
(by a factor of about 10"^ in fact). Let us examine the equation of motion 
(conservation of momentum) in the x-direction. If we include advection, vertical 
and horizontal momentum transfer (eddy viscosity) then dimensional analysis 
can be performed on this equation. As all reference to primed (fluctuating) 
quantities has ceased, the overbar can be dropped. The general x-wise equation 
of motion then is rather daunting, until one remembers that is not actually 
going to be solved. Here it is: 

d'^u d‘^u\ 
dx"^ dy"^ J 

The j/-wise equation is, similarly 

d'^v d'^v\ 
dx'^ dy"^ ) 

All this is by way of introduction! How much detail is given is of course in 
the hands of the module or unit leader. Neglecting the terms containing vh is 
equivalent to ignoring horizontal turbulence. If the non-linear terms are ignored 
too we get 

du I dp d'^u 

dv \ dp d'^v 

These are the unsteady Ekman equations. If we write 



dv dv dv dv . I dp d'^v 

dt dx dy dz p dy dz^ 



vh 



du du dudu I dp d'^u 

A 

dt dx dy dz p dx dz^ 



VH 



{u, v) = (ua + UE, VG + Ve) 



where 



-fvG = 
and fuG = 



the equations reduce to 



and 



duE 

~W 



fvE 



dvE 

Ik 



+ fUE 



I dp 
p dx 
I dp 
pdy 



= Vy 

= Vy 



d'^UE 
dz^ 
d'^VE 
dz'^ ■ 



We have separated out the geostrophic balance part (suffix G) from the fric- 
tional or Ekman part (suffix E). Geostrophic balance is a balance between 
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pressure gradient and Coriolis acceleration which permeates much of the at- 
mosphere and ocean. It gives rise to the characteristic “Lows” and “Highs” of 
pressure seen on weather maps in temperate regions. The important assumption 
which allows the above partitioning is that (ug,vg) remains independent of z 
and t. The final assumption is that the Ekman flow is steady, which eliminates 
the time derivative terms and leaves us with 



-fvE 
and fuE 



Vv 

Vv 



d'^UE 
dz'^ 
d'^VE 
dz^ ■ 



These can be solved elegantly using complex numbers as follows. We add the 
first equation to i times the second, where i = of course leading to the 

single equation 



ifW = V, 



d^W 



in the complex quantity W = ue + ivE- This is a second order differential 
equation with constant coefficients which can be solved given two boundary 
conditions on W. One is straightforward. As the wind is the driving force, it is 
reasonable to suppose that as we get deeper and deeper into the sea, W must 
get smaller and smaller in magnitude. Thus mathematically 



W 



0 as z 



— oo. 



remembering that z points upwards. The second boundary condition is more 
subtle. At the sea surface (z = 0) the wind blows. If we assume that the stress 
imparted by the wind is precisely the same as the shear stress experienced by 
the sea at the surface, then our previous assumptions concerning eddy viscosity 
leads to 

T = (r“,T*') = -pi^v 

at z = 0, where t = (r“^,r^) = r® -P is the (horizontal) wind stress vec- 
tor. Solving the Ekman equations is now straightforwardly done and gives the 
solution 




Having done the hard work and worked through all the assumptions carefully, 
we can reap the rewards and explore with confidence the properties of this solu- 
tion. Many of the properties are actually verified by oceanographic observation. 
At the surface of the sea, z = 0 so 

i 

fVv 




duE Ove\ dW 
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and using the properties of complex numbers, it is easy to see that this implies 
that the surface current is at a 45° angle to the right of the wind in the northern 
hemisphere. (South of the equator, / is negative which implies the current flows 
45° to the left of the wind.) This verifies observations made by Nansen, the 
Norwegian explorer who stuck his boat the Fram in the ice north of Svalbard 
(Spitzbergen) in 1896 in a bid to be the first to reach the North Pole. He was 
there three years and was not successful. In fact it was the request from Nansen 
to Ekman to explain this angle that led Ekman to his theory that has been 
repeated here. Ekman published his paper in 1905, the same year as Einstein’s 
seminal publication outlining the special theory of relativity for the first time. 

Here are a few auxiliary questions and suggestions for extensions. 

Integrate the current through z and show that the net transport is at right 
angles to the wind direction. Show also that this net transport is independent 
of the eddy viscosity. (A good student should be able to deduce that the eddy 
viscosity assumption is unnecessary for this particular result.) 

A slightly longer exercise would be to solve the time dependent Ekman 
equations: 

duE j. 0 ‘^ue 

, dvE r 9'^ve 

and -^ + Jue = 

This is most easily accomplished using the complex method as earlier, then 
using Laplace transforms. This leads to an ordinary differential equation which 
can be solved in terms of z and s (the Laplace transform variable). The solution 
is given in Laplace transform space as 



W = - 






exp 




where the overbar denotes the Laplace transform. It is impossible to invert this 
solution in closed form in terms of elementary functions. However there are 
theorems such as the final value theorem and the initial value theorem (see P. 
P. G. Dyke’s excellent Springer- Verlag Undergraduate text: “An Introduction 
to Laplace Transforms and Fourier Series”, published in 2001) which can be 
used to deduce the spin-up time for the Ekman layer, as the steady solution is 
called. 

Another avenue to explore is the solutions for the steady Ekman equations 
when the eddy viscosity depends on z. A linear dependence leads to Bessel’s 
differential equation; a quadratic dependence if designed carefully enough can 
lead to Euler’s differential equation with a solution in terms of powers of z. It 
is interesting to see how this modification changes the angle the surface current 
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makes with the wind. If the students have been given the above material for self 
study, there are other more adventurous avenues to explore such as the effects of 
the earth’s curvature (the j3 effect), unsteady effects such as Kelvin or Rossby 
waves or even a simple model of El Nino. Enough! The author could spend 
many more pages exploring extensions, but instead let us explore a different 
case study. 



5.3 Non-linear Oscillations 

Non-linear oscillations is a subject that has received a considerable boost re- 
cently through public interest in chaotic systems and several executive toys 
have been based on this. It is principally the enhanced power of the computer 
which has enabled recent advances in the study of dynamic systems, of which 
non-linear oscillations form a subset. However, the material for this case study 
originated from work done in the 1970s on designing stable semi-submersibles, 
ships and moored vessels and so general dynamical systems are not introduced. 
However, some students may be familiar with phase planes and the properties 
of solutions to non-linear first order differential equations which form part of 
the study of dynamic systems. These topics can slot conveniently into a more 
general course on differential equations. Indeed it is quite common for students 
to meet the idea of direction fields for non-linear first order differential equa- 
tions as early as in the first year of a mathematics degree. These topics are 
also related to the study of characteristics which is introduced briefly in the 
next section. Characteristics can also be part of an optional course on many 
undergraduate degrees. 

The mathematics of this case study is firmly set against classical solution 
techniques to ordinary differential equations. Therefore knowledge of power se- 
ries solutions, singularities, stability and the use of the Wronskian all feature 
and should be familiar to the students. The vehicle for the study is Buffing’s 
equation with a sinusoidal right hand side which represents non-linear oscilla- 
tions subject to periodic forcing. Here it is: 

x + kx + ax + (ix^ = F cos(ujt). 

In the last case study, the governing equations were painstakingly derived from 
physical laws. The derivation of this from Newton’s second law of motion is 
easier and is not done here. Whether or not it is done in class is left to the 
lecturer’s judgement. A Fourier series solution to this is sought, the explanation 
being that oscillations are expected, but their frequency is not obvious. So set 

x{t) = ao + ai cos(At) -I- 02 cos(2At) -I- 03 cos(3At) -I- . . . 
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and substitute into the forced Duffing equation. The term is still periodic 
but generates all kinds of cross-terms. If like cos(Ant) terms are equated on 
both sides, theoretically the Fourier series solution can be found. The full story 
can be found in specialist texts such as D.W. Jordan and P. Smith Non-linear 
Ordinary Differential Equations the third edition of which was published in 
1999 by Oxford University Press. As a trial solution we could try 

x{t) = acos(ujt) &sin(wt). 

This represents the first two terms of a Fourier series. The equation does not 
allow a constant term, but this severely truncated Fourier series does allow for 
phase differences. Substitution and some manipulation of the cube term leads 
to 

b{{u!^ - 1) - ^/3(a^ -k b^)} = 0 
a{(u;2 - 1) - 5/3(a2 -k 62)} = -p 

which implies 6 = 0 so the phase difference disappears! Moreover a has to be a 
root of the cubic 

- (u;2 _i')a~r = 0. 

The terms involving trigonometric functions of are neglected as they are 
assumed small compared to the terms having the fundamental frequency uj. 
These solutions are investigated graphically. For example there are three solu- 
tions provided (3 < 0 ,oj^ < 1 and E is small. If i there is only ever one 
solution. The next slightly more general solution to try is not to include fur- 
ther terms of the Fourier series, but to allow the coefficients a and b to become 
a{t) and b{t), i.e. slowly varying functions of time. Once more ignoring the 3wf 
harmonics this time leads to the coupled first order system: 

a = -^{(w^-1)- ^/3(a^ + 6^)} 

6 = + + £ 

which in the language of systems represents an autonomous system. Solutions 
are drawn in the (o, b) plane, and equilibrium points d = 0 and 6 = 0 represent 
the constant a and 6 solutions already obtained. These equilibrium points can 
be analysed in terms of their stability. As mentioned earlier, there can be three 
such points or there could be just one depending on the particular values of 
the parameters. This is a rich source of mathematical problems and the case 
study could, at this juncture, explore many possibilities. Chapter 7 of Jordan 
and Smith (1999) does just this. 
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Another route is to explore oscillations not at the frequency of the forcing 
but at some fraction of this frequency. These are called subharmonic oscillations 
and they can have important consequences for design engineers. Of course a 
simple linear SHM equation such as 

X + = r cost 

will have the subharmonic oscillatory solution 

1 , . 1 r 

a cos —t + osm —t — ^ cost, 

n n 1 — n ^ 

but subharmonics also occur less obviously. Looking for perturbation solutions 
of Buffing’s Equation 

x + kx + ax + Px^ = r cos(ujt) 

regarding /? as a small parameter is constructive. Proceed by letting t = uit 
which gives 

oj'^x" + kujx' + ax + Px^ = r cos(r), 

where primes denote differentiation with respect to t. The perturbation method 
is now applied in which the solution x(r) is expanded in a power series in P 

x{t) = Xo{t) + Pxi{t) + P'^X2{t) + ... 



as is w 



LO = Uq + PoJi + P‘^L02 + 



Taking damping fc = 0 for simplicity, this leads to a series of linear differential 
equations, the first two of which are 



WgXg + axo = r cos(r) 



and 



lOqx” + axi = —2 ujqUJiXq — Xg. 

In order for there to be periodic solutions to this last equation we must have 



a 1 



or ojo = S-s/a. 



This immediately gives the following: a subharmonic of one third is stimulated 
by applied forcing at three times the natural frequency around /3 = 0 . The 
presence of these subharmonics is therefore to be expected if the system is well 
approximated by Buffing’s equation and the non-linearity can be regarded as 
a perturbation from the linear. The solution for xo(t) is 



3;o(t) 



ai/3COS -r-P 6i/3sin 



1 r 

3 8a 



cos(r). 
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The constants ai/3 and 61/3 are evaluated by demanding that the right hand 
side of the equation for x\{t) has to be zero so that x\ is periodic and free of 
resonance. This gives non-linear equations for these constants, and the pertur- 
bation solution is built up in this way. It is important in case studies such as this 
to emphasise the practical application. In this case one can discuss oscillations 
of floating platforms due to waves and how the unfortunate presence of waves 
of a certain frequency could excite subharmonics present in the platforms and 
cause the platform to behave violently and even overturn. Stability analysis 
can be effectively employed on this perturbation solution: however there is yet 
a third way to analyse Buffing’s equation. 

First of all Buffing’s equation is transformed into a coupled system of first 
order ordinary differential equations by setting y = x. Hence the undamped 
version 

X + X + Px^ = r cos{ujt) 
where we have set a = 1 becomes 
X = y 

y = —x — Px^ + rcosLot. 

The method then proceeds using variational arguments. Suppose 

^ = x — X* 

77 = y-y* 

and we substitute for x and y into the above equations but ignore squares and 
higher powers of ^ and rj. Thus 

i = V 

1) = 

as X* = y and y* = —x* — Px*^ + F cos tot. Periodic solutions of the form 

X = a cos tot 

are now sought. In order for this to be a solution, we need 

^/3a^ + (1 — w^)a = r. 

This is obtained by substituting into the original Buffing’s equation for x and 
equating coefficients of cos tot. Again this is justified as long as a cos tot is an 
approximate solution which is in effect the first term of a Fourier series. If we 
eliminate rj from the above pair of equations in ^ and rj, then 

|+(l + 3/3x*")? = 0. 




120 



Managing Mathematical Projects - With Success! 



Further we approximate x* by acosut with a given by the root of the cubic 
above, to give the following equation for 

I + (1 + 2ujt)^ = 0. 

This enables the stability of the solution to be analysed. This equation is in 
fact a form of Mathieu’s equation which arises naturally from the equation 
obeyed by a simple pendulum whose support oscillates vertically. One possible 
direction for an extension to this case study is to derive the equation to such a 
pendulum in the form 

aO + {g — z)9 = 0 

where a is the length of the pendulum, z the amplitude of the vertical oscil- 
lation of its support and 9 the small angle the pendulum string makes with 
the vertical. As the coefficient of 9 in the above equation is periodic it is Hill’s 
equation 

x+ {a +p{t))x = 0 

which becomes Mathieu’s equation for the case when p{t) = k cos Lot. This is an 
example of parametric excitation which is behind many engineering problems 
including the oscillation of the Millennium Bridge over the Thames in London. 

There are many other possible extensions of this case study. One could be 
the analysis of subharmonic oscillations of floating bodies, including the gen- 
eralisation to n dimensional linear systems (Floquet theory) and the inclusion 
of damping. Another is the analysis of N linked forced pendulums. This has 
recently been examined by D. J. Acheson (see his papers in the Proceedings 
of the Royal Society of London, 1993, 443 239-245, and 1995, 448 89-95.) 
This derives Mathieu’s equation, and relates the stability to parameter ranges. 
This particular approach is elegant but quite straightforward and lends itself to 
investigation by a final year student. Of course, if the degree explicitly contains 
modules on non-linear dynamics and chaos, many more extensions are possible. 



5.4 Traffic Flow 

It may not be immediately obvious, but this case study stems from the study of 
wave motion. Many mathematics degrees have something in them about waves, 
but very few have dedicated courses that are compulsory. This is a common 
problem in introducing case studies: one has to start from a common knowledge 
base. To assume all students know about linear wave theory would discourage 
too many students, but to give a full account of linear wave theory would not 
be possible. A compromise is to give a quick run through of linear waves, in 
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particular to introduce the terminology. The simplest description centres on 
the sinusoidal wave 

4>{x, t) = asm{kx — cot). 

This is a wave of amplitude a, wave number k, angular frequency co, propagat- 
ing along the x-axis. t represents time. The frequency is / = a;/27r (in Hertz 
or cycles per second, whereas angular frequency is measured in radians per 
second). The wavelength is A = 2'Kjk, and the period is 1// or 2Trlijj. Waves 
have crests and troughs corresponding to the maxima and minima respectively 
of the sine wave. Progressive waves retain their form (sinusoidal above, but any 
periodic shape repeated indefinitely will do). Such a wave moves with celerity 
(alternative names are phase velocity or wave speed) c = cofk along the a;-axis. 
The other extreme is a wave that does not progress at all and is a standing 
wave. An example with a sinusoidal profile would be 



4>{x,t) = a sin kx cos cot. 



A standing wave has points (nodes) where there is never any vertical displace- 
ment. For the above wave these occur at a: = where n = 0, ±1,±2, .... 
At times t = ^(2iV -|- 1) where N = 0,±1,±2,..., the wave is completely 
flat. Most waves in reality are someway between being entirely progressive or 
entirely standing. All waves can be expressed as a train of progressive waves. 
The standing wave above is simply two opposing progressive waves: 

-{asin{kx — cot) + asin(fca; -I- oof)). 

The differential equation obeyed by the one-dimensional wave (progressive 
or standing or anything in between) is 



d^4> 1 d^(f> 

dx"^ (? dt'^ ' 

The variety of solutions to this, especially in three dimensional form 






1 d'^cj) 
dt"^ 



is vast, and the subject of thick textbooks and year long lecture courses. 

Linear waves are met in electromagnetism as solutions to Maxwell’s equa- 
tions: they are also an important solution to the equations of elasticity. In this 
guise they help us to understand earthquakes. In water they are solutions to 
the Navier-Stokes’ equations met in rotating form earlier in this chapter. Lin- 
ear water wave theory works well to describe tides, swell waves and other long 
waves where the wavelength is at least double the wave height (wave height is 
an engineering term equal to twice the amplitude of a sine wave) . However an 
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underwater earthquake can cause a tsunami (once poorly termed “tidal wave” 
in some old newspaper headlines) which can be virtually a wall of water when 
it reaches a coast. A tsunami in this situation is not described by linear wave 
theory. Non-linear waves exhibit features that are very distinctive, amongst 
which are shocks. Perhaps amongst the most familiar shock wave is the bang 
that accompanies aircraft that break the sound barrier, while the tidal bore is 
another. Believe it or not, the mathematics of shock waves is the same that 
can be used to describe the propagation of stoppages in dense traffic. Anyone 
who regularly uses the M25 ring road around London in the UK will have 
practical experience of this type of shock wave. This case study describes the 
mathematics required to understand it. 

The crucial property that all linear waves possess is that if (j>i and 4>2 are 
solutions to the wave equation, then so is ai4>\ + a 2 <t >2 where ai and 02 are 
constants. This is no longer the case for non-linear waves and new methods 
have to be developed to study them. One of the most profitable is the Method 
of Characteristics and this will have to be outlined in order to understand traffic 
ffow modelling. A characteristic curve carries information forward in time from 
an initial condition. The general theory belongs squarely in courses on partial 
differential equations, but a few hints for students not familiar with this can 
run along the following lines. Given a first order differential equation in the 
form 



du , du 



we can write 





dx 


a 




dy _b 




dt 




~~T~ — ^ 

dt 


then 












dx du 


dy 


du 


du 




dt dx 


dt 


dy 


dt 



and the family of characteristic curves is given by w = u{x, y) for which the 
integrals of these three equations are the parametric form. Although this is 
linear and first order, this method of finding a family of curves or indeed surfaces 
that satisfy a partial differential equation generalises to some higher order non- 
linear equations. It gives a good way of determining solutions, and the phrase 
“Method of Characteristics” is used for all such methods. Here is an example of 
determining a solution to a non-linear second order equation using the method 
of characteristics that could form part of an introduction for students. Consider 
the equation: 



d^u , , d^u d'^u 

+ +x- 



= 0 . 



dx^ dxdt dC 

The initial values of u and its derivatives are known, and the equation thus 
enables all derivatives of u to be determined at time t = 0. Taylor’s theorem 




5. Case Studies 



123 



can thus be used to find a power series solution. Of course, we cannot solve 
this equation without knowing these initial conditions: however as 



1^(0, x) = /(x) (say) 



and 



du 



{0,x) = g{x) (say) 



we can differentiate these and obtain three equations obeyed by the three second 
order partial derivatives of u, namely 



d'^u 
^ dx"^ 



{t + x) 



d ‘^1 



dxdt 
d'^u dx 
dx"^ dt 
d'^u dx 
dtdx dt 



d'^u 

d'^u 

dtdx 

d'^u 



0 

df df dx 

dt dx dt 

dg dg dx 

dt dx dt 



The consistency condition for these three equations is that the determinant of 
the left hand side is zero. This means that 



t 

dx ^ 

dt 

0 ^ 

dt 



t + X X 
0 
1 



= 0 



or 



/ dx 



dx 



hlT -{x + t)—+t = o. 



\ dt 

The solution of this quadratic equation is 

t 



dt 



dx 

dt X 

Integrating these gives the two sets of characteristic curves 



and ^ = 1. 

dt 



x"^ —t^ = const. 



and 



X — t = const. 



There is no need to take this particular example further, but knowledge of 
these characteristic curves enables the equation to be transformed into so-called 
canonical form which in turn enables the solution to be determined. For fully 
non-linear equations, the method of characteristics can still be applied: how- 
ever the characteristics themselves cannot in general be found explicitly. The 
equation will for example usually be implicit and depend on u. Also, charac- 
teristics are only real and distinct for some equations (designated as hyperbolic 
partial differential equations). For equations with variable coefficients, there 
may be regions where the equation flips to being parabolic and then elliptic in 
which case the characteristics become complex and cease to exist in real space. 
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All this can be mentioned as an introduction to this case study, and reference 
should be made to texts on partial differential equations for further study and 
perhaps some student exercises. 

Turning to traffic flow modelling, we define a single one-dimensional stream 
of cars. Assume that a car density p can be defined and that the local car 
velocity v{x, t) is a function only of this density, i.e.v = v{p). This is a candidate 
for modification to develop more sophisticated models as it implies that the 
speed of any given car is entirely determined by the speed of the car in front. If 
q{x, t) is the car fiow rate, then over a length of road x\ — X 2 the conservation 
of cars implies that 

d 

— J p{x,t)dx = q{xi) - q{x 2 ). 



Taking the limit as (xi — X 2 ) ^ 0 we get 



dp 

m 




which is the one-dimensional version of the continuity equation familiar to 
those who study fiuid mechanics. A certain amount of discussion can be based 
around the consequences of this equation. For example, given that q = pv, 
stationary traffic v = 0 imposes an upper limit on the density of cars. Also as 
density increases, drivers must slow down. Both of these are reasonable, but 
the model does have shortcomings and does not predict the observation that 
the maximum fiow rate in heavy traffic tends to occur when cars are travelling 
at moderate speeds. Go too quickly and a traffic jam results. A traffic jam 
where the cars are stationary can be analysed by using wave theory coupled 
with characteristics. The outline here owes much to the excellent account given 
in Wave Motion by J. Billingham and A. C. King, published by Cambridge 
University Press in 2000. The problem is expressed mathematically as follows: 



dp 

dt 



+ c{p) 



dp 

dx 



= 0 



where ^ 

c{p) = = v{p) + pv'{p) 

is the kinematic wave speed. This name stems from the solution for small 
variations in density for which the problem is linear. In this case, the system 
is perturbed from a state p = po so that p = po + p where p << 1. The above 
equation then approximates to the one-dimensional linear equation 



dp 

dt 



c(po) 



dp 

dx 



= 0 
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which has a general solution: 

p = f{x- c{po)t) 

representing a wave propagating with speed (celerity) c{po). This is rather 
idealised, but it does tell us that should there be a maximum value of the flux 
q{p) a,t p = p* say, then c > 0 for p < p* but c < 0 for p > p*. This is 
the phenomenon of the traffic wave travelling in the opposite direction to the 
flow and corresponds to the traffic ahead stopping for no apparent reason and 
this stopped traffic backing up along the road. Curves upon which the traffic 
density is constant are the characteristic curves for this problem in x — t space. 
For this linear approximation, the characteristic curves are simply the straight 
lines 

X = xo + c(po)t 

and these are shown in Figure 5.2. The non-linear problem brings out the value 




Figure 5.2 



The characteristics for the linear approximation. 



of using characteristics. In order to focus a case study, it is a good idea to 
examine a specific problem. Let us set as car velocity 

Vo 



v{p) = 



■ i^Pmax P^ 



Pmax 
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from which the celerity is 



c(p) = = -^{Pmax - 2p), 

Pmax 

and a specific functional form for the initial car density is chosen as follows: 

Pl + PRe^^^ 



Po{x) = 



1 + 



where pl,Pr and L are positive constants. This is a representation of density 
changing gradually from for large but negative x to pa for large and positive 
X crossing the value po = L a,t x = 0. The case where pL > Pr, which will always 
be assumed here, is shown in Figure 5.3. Now, we are considering the non-linear 




Figure 5.3 



The initial conditions for car density p{x,0) = po{x). 



problem: 



dp f .dp 



subject to 



/j(x,0) = po{x) 
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which has been assigned above. The task is to find a set of characteristics upon 
which to base the solution to the boundary value problem. As the problem is 
driven by the initial conditions, from the properties of characteristics outlined 
above, we seek a solution p{x, t) as follows. Look for a characteristic curve 
X = X{t) for which 



p(A(t),t) = p(A(0),0) = po(A(0)). 



That is, differentiating as in the above example on characteristics. 






dp dX dp 
dt dt dx' 



Having defined the celerity c{p) through 



|wv«)..)) = | + cw| 

it is clear that we require 

dX 

By construction, p is a constant on each characteristic, therefore so is c(p). 
Hence the above equation implies that, even in this non-linear case, each char- 
acteristic remains a straight line and takes the form 



x = X{t) = X (! + c(po(a^o))^ for — oo < xq < oo. 

Inserting the chosen form for po (x) into these expressions gives explicit formulae 
for the characteristics. Several student exercises lend themselves to the lecturer 
at this point. First of all, students should be able to draw the graph of c(p) as 
a function of x. Next, the characteristic curves (straight lines) should be drawn 
on an a; — t plot. They will take the form of a “fan” . Students could be asked to 
explore numerical solutions to the boundary value problem too. The following 
solution to the problem is well worth exploring as it gives several insights: 

{ PL for X < c{pL)t 

\pmax{.^ - for c{pL)t < X < c{pR)t 

PR for X > c{pR)t. 

For example, with pl = Pmax and pr = 0 cars pulling away from a queue at 
traffic lights as they turn green is simulated. The “shock”, that is the interface 
between stationary and moving cars, propagates back through the queue with 
celerity c{pmax) = —vo- The general solution curves for p(x, t) should be drawn 
against a;— c(po(0)), and the gradual dissipation of the front at x = c(po(0)) with 
time noted. Other generalisations of a mathematical nature are possible (see the 
book by Billingham and King referred to earlier) as well as observations such 
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as if a traffic jam is approached at a slow enough speed it will always vanish, 
which is obvious after a little thought. Conversely, fast driving in dense traffic 
causes jams, also obvious although you wouldn’t think so given the evidence! 

Students can be asked to take this case study further. If the taught part 
of their course contains the theory of characteristics, then they should be able 
to analyse the characteristics defined here more fully. For example, investigate 
what is represented by their intersection and interpret this in terms of shock 
waves. In turn, this can be interpreted in terms of the behaviour of real traffic, 
and when it cannot students can investigate which of the initial assumptions 
are violated. Students can also investigate alternative functional forms for the 
traffic density p. 



5.5 Contour Integral Solutions to ODEs 



Back in the mists of time, a lecture course on complex variables formed a 
compulsory part of the first two years of any self respecting mathematics de- 
gree. This is no longer the case, and it is not only possible but commonplace 
for mathematics students to graduate knowing no complex variable theory at 
all, and little in the way of related matter beyond the arithmetic of complex 
numbers. Such students are denied the use of the power of complex variables. 
A particularly strong use is in finding solutions to both ordinary and partial 
differential equations and it is this that forms this case study. 

As a prerequisite, students should know how to solve both ordinary and 
partial differential equations. Solution methods such as the Frobenius method, 
variation of parameters and complementary function and integrating factor 
techniques for ordinary differential equations, and the use of separation of vari- 
ables should be included. Some knowledge of Laplace transforms would also be 
useful, but not essential. 

As this case study concentrates on finding contour integral solutions to cer- 
tain types of ordinary differential equation, so a knowledge of complex variable 
theory up to the definition of integration and singularities is essential. In fact if 
the students have not had the opportunity to study a unit on complex variable 
theory, this case study is unsuitable. 

To begin in the most general way, consider the ordinary differential equa- 
tion of nth order but with linear coefficients. This is sometimes referred to as 
Laplace’s linear equation, and it is written in the form 



(F^w F 

[anZ + bn) + {an-lZ + bn-l) 

where a„ and bn are (complex) constants, 2 = 



{a,oZ bo)w — 0 , 

X + iy is the standard complex 
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variable and w = w{z) is a function of the complex variable 2 . The key point 
here is that with linear coefficients, an integral transform technique works. If the 
coefficients contain quadratic or higher powers of 2 : there is no guarantee that 
the method will work, though it might as we will see in the last example in this 
section. Those students familiar with the technique of using Laplace transforms 
for solving ordinary differential equations (see P. P. G. Dyke’s Introduction to 
Laplace Transforms and Fourier Series, published by Springer- Verlag in 2001, 
chapter 3) will have a head start. Those for whom the technique is new will 
need a little more time. The idea is to set 



w = [ f{z,C)dC 

Jc 

where f{z,Cf) is a complex valued function of both 2 ; and f. The behaviour of 
/ in the C, plane (for fixed z) is such that it has a finite number of isolated 
singularities in the form of poles and/or branch points but not essential singu- 
larities. This means that along any particular branch of the contour C if the 
function f{z,() contains the factor {( — a)^ (real k) then although ostensibly 
this is many valued a value may be assumed as long as we remain on a chosen 
branch of the function. There are many concepts here; singularity, pole, branch 
point, essential singularity. It is therefore reinforced that it is a good idea for 
this case study to take place after a reasonable grounding in complex variable 
theory. 

The way to proceed is already hinted at with the mention of Laplace Trans- 
forms. We try setting 

f{z,0 = e<P{0 



so that 



w= / e^‘^P{0dC. 

Jc 

First of all, since it is legitimate to exchange derivative and integral operations 
for this integrand, we note that 



bp 



dPw 

dzP 



bp [ C^e<P(C)dC, 
Jc 



for p = 1, 2, . . . , n. This is the “Laplace transform trick” . The terms in Op need 
more careful attention as follows: 



OjjZ 



dPw 

dzP 



dP 

= a„z - — 
^ dzP 



e<P(C)dC 



ic 



= Up f zfPe^^PiOdC 

Jc 

= ap [C"e^‘^P(0] c - ^ ^ df. 
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Thus in order to satisfy Laplace’s linear equation we require that 

E X c-^pj^ (c"^(0) '^c} = 0. 

This can be written 

[^(C)J^(C)e^^]c + ^B{0e^<P{0 - (A(C)P(C))] dC = 0 

where 

n 

MO = 

P—0 

n 

and P(C) = '^bpC- 

p—0 

This is satisfied by setting 

BiOPiO = I (MOPiO) 

and by demanding that 

[^(C)P(C)e^^]c = 0. 

The first order differential equation for P{0 that results from the first of these 
demands leads to P(C) taking the form: 

P(C) = e'=«^(C- ai)'=P..(C 



where oi, . . . , a„ are the zeros of the polynomial A(C) provided they are all 
different. The requirement that the integrated part vanishes takes the form 

g(^+feo)C((- _ Q,^)('=i+i) ^ = 0 

which leads to choices for the contour C which are both interesting and in- 
formative. This general theory is a good exercise for the students in the use 
and interpretation of complex contour integration. However specific examples 
should be done in class in order to find specific f{z,Qs and specific contours. 
A good choice is to solve Airy’s differential equation. This equation occurs in 
several branches of applied mathematics where an oscillatory solution in one 
part of the solution domain gives way to an exponential solution in another. 
Early models of the steady currents that inhabit equatorial ocean regions are 
one example. Airy’s ordinary differential equation takes the form 



w" = zw. 
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Let 

w = [ e^‘^P{Od( 

Jc 

so on differentiating twice with respect to z 

w"= [ ee^^PiOdC 
Jc 

and 

wz = [ ze^^P{C)dC, 

Jc 

= [e^^P{0]c~ [ ^^‘^P'iOdC 
Jc 

So Airy’s equation w" = zw implies 

[e^^P{0\c- + P'(0)e^‘^dC = 0. 

To satisfy this, we first choose the function P{C) to satisfy an equation that 
makes the second expression vanish, viz. 

-ep(o = p\o 

from which on integration (ignoring the arbitrary constant-we only want one 
solution) gives 

P{() = e-3«' 

together with demanding that 

[C<P«)lc= [c<-5<’]^ = 0. 

Now, the — P^rt will dominate far from the origin. Also, writing C = re*® 
we have 

11 i 

— -C^ = --r^ cos 30 — -r^ sin 30 
3^ 3 3 

and the real part of this is negative provided 

cos 30 > 0. 
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Figure 5.4 One solution contour for Airy’s equation. 

(but not three) independent contours, that is contours that are not deformable 
into each other. This illustrates that there are indeed two independent solutions 
to this second order differential equation. One possible extension to set the 
students is to solve the equation 

I II 

w = zw 

or perhaps the generalised Airy equation 

= ZW. 

Neither of these would tax the good student. There are other equations one 
could tackle by this method, for example the Legendre equation: 

n.cPw dw 

(1 - ^ 

This is not a member of Laplace’s linear ordinary differential equations: never- 
theless an contour integral solution can be obtained as follows. A power series 
solution can be generated in the standard form 

OO 

W = Z~^ ^ CrZ~^. 
r—0 
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This is because 2 = oo is one of the three regular singularities of this differential 
equation (the others are at z = ±1). Standard series solution methods are used 
to find the Cr and these may need to be revised. A polynomial solution is 
possible provided n is an integer. It has the form 

u, = A.~ll- + n{n- l){n- 2){n-3) , 

X 2{2n-l) 2.4.(2n- l)(2n-3) 

Taking the constant A to have the value 



(2n)! 

2"(n!)2 



in turn leads to the Rodrigue’s formula for the Legendre equation 



1 d” 
2”n! dz” 






1 )”. 



This is a neat way of writing the polynomial solution which students may have 
met before. If they have, so much the better. These days, the next bit will 
probably be new. The Cauchy integral formula 



d” 

dz” 






nl 

2TTi 



/(C) 

(C — z)”+l 



dC 



is introduced. Putting 



m = 



we get the contour integral solution 



(z2 - 1)” 
2"n! 



1 

2Tri 



2"(C-z)”+i 



Such integrals are called Schlafli integrals. Having derived this using n as an 
integer we can allow n to be completely general and rename it 12 to emphasise 
this. The Schlafli integral remains a solution as can be seen by inserting the 
integral 

_ ^ f ie-iy 



into Legendre’s equation 



Os LL UU ^ IIW , ^ s 

(1 - 2 )-r^ - 2z— + + l)w = 0 



dz^ dz 
but we need to find C such that 

iz + 1 r(C2-l)-+i 
2TTi2‘' [ (C - z)'^+2 



= 0 . 



ic 
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Of course, since now v is no longer an integer, the three points C = C = 
and C, = z are all branch points so this condition is no longer automatically 
satisfied. However, by choosing the contour C with enough care, solutions to 
Legendre’s differential equation are obtained. A simple closed contour contain- 
ing the points C = 1? C = -2^ but not = — 1 gives one solution. This works 
because -I- 1 — -I- 2) is an integer. Another possible choice for C is a figure 

of eight around C = 1 ^md = — 1 not containing the point C, = z. Again such 
a contour works since C = 1 (■ = — 1 are circled in opposite senses which 

means that the value of 

{C - 1)“'+^ 

(C - zy+^ 

does not change as the contour is traversed. Additionally the two contours 
are not deformable into each other; hence we have two independent solutions. 
There are no others. 

The students could be given Laguerre’s equation, Bessel’s equation or some 
other equation with a polynomial solution and be required to go through similar 
steps as above to obtain a Schlafli integral solution. The classic text: A Course 
of Modem Analysis by E. T. Whittaker and G. N. Watson although old (1902) 
is a rich source for examples. 

This case study will only be accessible to students with a reasonable knowl- 
edge of complex variable theory. However, it does show the power of complex 
variables to generate solutions to ordinary differential equations. Other direc- 
tions to pursue are the relationship of Schlafli integrals to generating functions, 
and the use of similar methods to solve partial differential equations . This latter 
topic is probably more suited to MMath, MSci or other advanced mathematics 
courses. 



5.6 Optimisation 

For the past thirty five years undergraduate degrees in mathematics have con- 
tained some numerical analysis or at least some numerical methods. This of 
course reflects the developments in computing. These days, there are sophis- 
ticated software packages that enable students to pursue many different kinds 
of numerical mathematics. Software such as MAPLE are designed to minimise 
the amount of computer programming, an unpopular topic with students (es- 
pecially computer science students oddly enough). This section gives one area 
that students can explore using numerical mathematics. 

Optimisation is a large and expanding area of mathematics concerned with 
finding the “best” . This might be the maximum value of profit, the minimum 
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value of cost, the best route through, the procedure that gets there fastest, 
etc. Mathematically all of these require finding a maximum or minimum of a 
function, and this in turn requires finding the roots (solutions) of an equation. 
In one dimension this takes the form 

fix) = 0. 

Most students will know more than one method for solving such an equation 
numerically. Perhaps the most common is the Newton-Raphson method. This 
and other cruder methods such as bisection are met in the first year. Indeed, 
solving fix) = 0 numerically is a popular vehicle for coursework in ‘A’ level 
(A2) Mathematics taken during the last year at school. In many dimensions 
this optimisation problem generalises to 



flixi,X2, ■ ■ 


■jXn) 


= 0 


f2ixi,X2, ■ ■ 


■ 1 ^n) 


= 0 


fsixi,X2, ■ ■ 


■^Xn) 


= 0 


fnixi , X2 1 ■ • 


■ 1 ^n) 


= 0, 



where n is an integer, typically 2 or 3 for student problems. In optimisation the 
n different functions that are zero arise out of equating the n partial derivatives 
of a single function f{xi,X 2 , ■ ■ ■ , Xn) to zero. The basic idea of most numerical 
methods for solving such a system of equations is to make an informed guess and 
use properties of the functions to improve on this guess. If the guess is arbitrary, 
then one runs the risk of being so far away from the optimal solution required 
that no matter how sophisticated the routine, it is never reached. On the other 
hand, how do we make an informed guess? That is one question. Another is ex- 
actly what properties of the functions fkixi,X 2 , ■ ■ ■ , Xn), k = 1, 2, . . . , n does 
one use? These are the questions tackled in this case study. It should be men- 
tioned here that if these functions are linear, then we are in the realm of linear 
programming. Linear programming is often taught within operational research 
which in turn is often thought of as a separate subject. “Mathematics, Statis- 
tics and Operational Research” is the title of some departments. In the United 
Kingdom, the QAA (Quality Assurance Agency) the national body that is des- 
ignated as the guardian of the quality of UK higher education, also uses the 
title “Mathematics, Statistics and Operational Research” to cover mathematics 
and allied subjects. This title emphasises the separate nature of mathematics, 
statistics and operational research to some minds. Operational Research (or 
OR for short) really started life in the Second World War and has grown along 
with the ability of computers to solve systems with large arrays. The most use- 
ful problems involve maximising profit functions or minimising cost functions. 
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This is done by analysing the real problem and devising a number of inequali- 
ties obeyed by the variables. The commonest technique is the simplex method, 
and add-ons such as the use of hidden variables and integer programming are 
possible. This case study avoids linear programming, and concentrates on the 
cases where the functions are non-linear. It might be good to tackle a real prob- 
lem involving specific choices of function that represent reality, on the other 
hand a more general approach means that the students can be introduced to 
extreme cases where pathological functions can be introduced to highlight the 
shortcomings of certain methods (and certain software!) The choice depends 
largely on student clientele and lecturer preference. Here, we shall be going 
down the more theoretical route as this case study is principally for mathe- 
matics undergraduates. Just in case there are students who have forgotten the 
Newton-Raphson method, it is a good idea to spend a few minutes on restating 
it. To solve the equation f{x) = 0 try a value a;o, then use the algorithm 



— ^k — 1 



f{xk-l) 



fc = 1,2, ... 



to improve the guess. It is a good idea to draw the picture shown in Figure 
5.5 which helps by showing several important features. First of all, if the first 
guess is poor, the slope of the function might make the next guess worse, or if 
the tangent to the curve is too close to the horizontal then f'{x) is close to zero 
and the method fails. It is a good method once the first guess is appropriate. 
This gets the students in the right frame of mind for numerical work which is 
essentially practical. Do not go slavishly following a method which is taking 
ages to converge or is not working as it should. Try another “first guess” and 
start again! The Newton-Raphson method can be done for a cubic say or a 
function such as f{x) = (xsina; — 4cosa;) and set up on MAPLE or even a 
spreadsheet such as Excel and animated as a class demonstration with a press 
of the return key getting a better and better approximation. The poor first 
guess can be similarly demonstrated. (A cubic with three real roots works well 
here-try to get the middle root!) 

Moving on to functions of many variables, one obvious question is does the 
Newton-Raphson method generalise? The answer is yes, but it is not quite as 
useful. One of the most useful generalisations is the DFP (Davidon-Fletcher- 
Powell) method which we will meet later. First of all, as we have said optimi- 
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Figure 5.5 The Newton-Raphson technique. 



sation involves solving the n partial derivatives of a function of n variables: 



/xi(Xl,X2, . . 


■ ■ 7 ^n) 


= 0 


fx2 (^1 7 ^2 1 ■ • 


■ ■ 7 ^n) 


= 0 


fx^ ( 3 ^ 1 ) X2-, ■ • 


■ ■ 7 ^n) 


= 0 


fxn (^1 7 ^27 ■ • 


■ ■ 7 ^n) 


= 0 



where the notation: 



f k-12 n 

dxk 



has been adopted for convenience. It is neater to use vector notation as follows: 
write 



X = 


{Xi,X2,..., 


Xnf 


a = 


{ai,a2, ■ • ■ 




So. = 


(Jcti, Sa.2, ■ 


• ■ 5 


V = 


(dl dj_ 
\dxi ’ dx 2 


dr 

’ ■ " ’ dXn 
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and the Hessian matrix of second order partial derivatives is defined as 






G = 



d^f 


0"/ 


oV 


0X1 2 

oV 


0X1 0X2 

d^f 


dxidxfi 

d^f 


dx2dx\ 


0X2^ 


dx2dXn 


d^f 


0"/ 


d^f 


dxndxi 


dXndx2 


■ ’ dXn^ 






X— Ql 



The first two terms of Taylor’s theorem in n variables is now written as 

/(x) « f{a) + Sa.^^ f{a) + ^5<y^G5a., 

which approximates the function /(x) near the point ot. The direct analogy to 
the Newton-Raphson method for this n variable function would be the iterative 
scheme 

Xfe = Xfc_i - [G(xfe_i)]“^V/(xfe_i). 

The problem is having to invert the Hessian matrix G(x)fc_i at each step 
which although theoretically possible makes for a very time consuming and 
lengthy process. It is also necessary to calculate second order derivatives. Are 
such complications really necessary? It is a good idea to discuss what the real 
problem is with the students before plunging into involved calculations. The 
gradient of the function V/(x) gives a direction of steepest ascent and descent 
in n dimensional space. An analogy with walking over hills and down dales in 
two dimensions is always useful here. Therefore it might be possible to use this 
fact to home in on maxima or minima. Such methods, called methods of steepest 
descent, do not involve inverting matrices, and only the first derivatives of /(x) 
need to be found. Students will ask what’s the catch! The “catch” is that the 
method can be very slow. Although the calculations are more straightforward, 
there may be a lot of them. The basic iterative scheme is 



1. choose a-k 



2. calculate /(a^) 

3. calculate V/ and evaluate at x = a^. Call this h^. 

4. form X — ak — Ah^ 

5. find the minimum value of the single variable A, where F(X) = /(a^ — Ah^) . 

From straightforward use of the chain rule, we have 

dF df dxi 

dX ^ dxi dX 

I 
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This can now be used to determine the minimum value of A, and hence deter- 
mine the next approximation through 



A 

where on the right is the initial guess a^. One possible student assignment 
is to send the students off to investigate such methods, possibly using a specific 
example such as Rosenbrock’s function 10(a;2 — X 2 )^ — (1 — starting at 
the origin. Another is to investigate other optimisation methods such as the 
use of line searches and using conjugate gradients. Perhaps the most successful 
method of optimisation combines the simplicity of the steepest descent method 
with the rigour of the Newton-Raphson method. One method that does this 
is the Davidon-Fletcher-Powell method. Students need to realise that one is 
approximating already in the Newton-Raphson method through truncating the 
Taylor series, so further approximations brought in by inexact ways of inverting 
the Hessian matrix are acceptable as long as they are carefully done. Instead 
of working with G we define 

where t is a parameter. This is a line in n space. Define 

fj(i'-l-l) _ jj(r) g(r) 

where is a correction term. The DFP algorithm (fcth step) is as follows: 

1. The search direction is determined by where H^'’^ is 

the approximation to the inverse Hessian matrix and 

gM = V/(x('-+D) - V/(x«). 

2. The step-length is determined by minimising on the lines given above 
using steepest descent, that is is the value of t that minimises the 
function /(x^’’^ -I- tp^'’^). 

3. Now update x^’’^ using 

xP+i) = . 



4. Finally, the next approximation to the inverse of the Hessian is given by 



jj(r-l-l) _ jj(r) _ 



1 



(r)Tfj(r)y(r) 



fj(DyR)y(>')Tjj(r) _|_ 



1 



g 









where 
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This algorithm looks complicated, and it only comes alive when examples are 
done. Here is the DFP algorithm used to try and find the minimum of the 
function 

f(xi,X 2 ) = X1X2 + ix\ 



The start vertex is (1, 1), that is a;i = 1 and X 2 = 1- This gives /(1, 1) = 3 and 
V/ = (2xi — X 2 , —x\ + 6 x 2 )^. At the point (1, 1), V/ = (1, 5)^. The first step 
is thus 




Xii — 1 + 1 

X 12 = 1 + 5t. 



The parameter t is determined by minimising /(I + t, 1 + 5t), that is 

(1 + — (1 T T 5^) T 3(1 + 5t)^. 

Using standard methods, the answer is t = —0.167. The new value of x is thus 



Xi = 



0.833 

-4.843 



with 



V/i = 



1.501 

0.157 



The updated approximation to H is and is given by the rather involved 

looking formula in the DFP algorithm. First we note that 



and 



and so 



Hi 



HiV/i 



ho = xi - xq = 



-0.167 \ 
0.157 ) 



V/i - V/o 



0.501 \ 
-4.843 ) 



( 1 


0 ^ 


1 ^ ( 


0.251 


-2.426 \ 


1 0 


1 y 


' 23.706 V 


-2.426 


23.455 J 



1 / 0.0279 0.139 \ 

^ V 0-139 0.697 ) 

f 0.996 0.137 \ 

V 0.137 0.187 ) 

f 0.996 0.137 \ / 1.501 \ / 1.517 \ 

V 0.137 0.996 ) 0.157 ) ~ \ 0.235 ) ' 
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The next step is to consider /(0.833 — tl.517, 0.165 — tO.235) in order to find a 
value of t for which / is a minimum. This value is t = 0.548. To calculate the 
next value of x we use the formula 

X. = X. - 0,548 ( J ) 

which equals 0 to a good approximation. The DFP algorithm has thus found 
the minimum in just two steps. This has to be the case for quadratic functions, 
and students should be asked why. 

Further exercises for students could be to use software to solve more compli- 
cated problems. They could try using the DFP algorithm on “show” functions 
such as 



{xi + 10 a; 2 )^ - 5 ( 0:3 - X 4 )^ -I- (x 2 - 20 : 3 )'^ -I- 10(a;i - 
with start value (3,-1, 0, 1), or 

100(0:2 — a:i)^ + (1 — a:i)^ + 90(0:4 — + (1 — 2^3)^ + 10.1((x2 — 1)^ 

-|-(X 4 - 1)^) -I- 19.8(x2 - 1)(X4 - 1) 

with start value (—3. — 1, —3, —1). 

On both of these, using the Newton-Raphson method does not work and 
steepest descent is extremely slow. 

The field of non-linear optimisation is only thirty or so years old and has 
burgeoned on the back of the exponential increase in computer power. It is a 
very fertile area for both theoretical and practical case studies, group projects 
and individual projects. 

By contrast, this last case study reaches back into the History of Mathe- 
matics and should be accessible to all third year students without the use of a 
computer. 



5.7 Euler and Series 

The only prerequisites for this case study are knowledge of series, some 
trigonometry and elementary complex numbers, all normally found in math- 
ematics curricula in the last years of school. They would certainly have been 
covered by the end of the first year at university. 

Leonhard Euler had one of the finest mathematical minds of any era. This 
is not a controversial statement. The man left his mark on virtually all areas 
of mathematics, both pure and applied. If combinatorics is labelled as a rather 
pure area of statistics, then he can be said to have contributed to statistics 
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too. He was born in 1707 in Switzerland, near Basel and spent the bulk of his 
professional life as Professor of Mathematics in St. Petersburg in Russia with a 
25 year interlude (1741-1766) in Berlin. He died suddenly on 18th September 
1783 aged 76, but not before writing more papers on mathematics than anyone 
else before or since. For more on this remarkably productive man see books on 
the History of Mathematics; I recommend especially Euler, Master of us All 
by William Dunham, published by the Mathematical Association of America 
in 1999 (185pp) which mingles biography with mathematical detail in a highly 
successful and entertaining way. 

This case study focuses mainly on Euler’s contribution to summing certain 
infinite series. It also touches on his pioneering work on complex numbers. In- 
finite series were a particular favourite with Euler, and he achieved much with 
them. Notice my reluctance to use the word ‘proof’. Euler possessed an incred- 
ible intuition for arriving at correct results and he often used mathematically 
illegal shortcuts to get there. In the eighteenth century, the principles of math- 
ematical rigour had not been firmly established. Advances in analysis occurred 
through Lagrange right at the end of the eighteenth century and in the next 
century through the work of Cauchy, Riemann, Gauss and others. This hav- 
ing been said, when Euler achieved a result he was aware of imperfections in 
his proof and often improved on his initial mathematics and re-derived results 
in many different ways. He had the uncanny knack of almost guessing correct 
results before establishing them with more and more rigour. He was the very 
essence of intuition and tenacity. For example, using the formula 

sin(A + B) = sin A cos B + sin B cos A 

where A and/or B are complex numbers is completely unjustified as this for- 
mula is established through geometry in which both A and B are angles. How- 
ever, using it Euler established de Moivre’s theorem 

(cos 0 -I- i sin 0)” = cos nO + i sin n9. 

He went on to derive 

= cos 9 + is\r\ 9 

from which comes the remarkable 

-P 1 = 0 

an equation that links the five most important numbers in mathematics. All 
of this derived from what now would be considered unjustified mathematics. 
Through this kind of historical material, students are made aware of the changes 
in mathematical standards of rigour as well as developments in mathematical 
notation. On this latter point, all mathematics students should see how alge- 
braic equations (linear, quadratic and cubic) were described in the fourteenth 
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and fifteenth century before the onset of modern notation. One then begins 
to understand how much harder mathematics can be with poor notation, and 
how much easier it might be with good notation. To return to Euler, one result 
worthy of study is his way (or ways) of finding the sum 



E 



1 



Let us run through his first method. First of all the “infinite polynomial” (as 
Euler calls it) is defined by 



^2 

/ \ U/ kL 



X 

91 



which sums to 



P{x) = 



sinx 



This result was reasonably well known in Euler’s time. Now, since is zero wher- 
ever X = riTT, n = 0, ±1, ±2, . . .. Given this, Euler then reasons that 

sinx = x(l- -)(1 -—)...(! -k -)(l-k —)... 

7T Z7T 7T Z7T 

2 9 9 



Thus 






Questions of the convergence of the right hand side are left unanswered at this 
stage, but could be a discussion point with the class. Euler now expands both 
sides in powers of x, rearranging to give: 



x^ x^ X® X® 



= 1 - 1 — — — — — 

7t2 47t2 97t2 167t2 25^2 



+ ...)x¥.... 



Is the rearrangement of the infinite series on the right legal? Equating coeffi- 
cients of x^ yields 



1 



1 



^ — I 1 + 

3! 7T 



1 



1 

16 



1 

¥ 



from which 




1 

9 



1 1 
16 ¥ 



7T 



2 



6 ■ 



Euler calculated the left hand side numerically. Full frontal attacks are not 
useful as the series on the left converges only slowly. For example, summing the 
first 1000 terms gives an answer accurate to only two decimal places. Students 
should be asked to verify this, these days using a spreadsheet perhaps or a 
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favourite piece of mathematical software. Euler had no such luxuries and used 
the following ingenious calculus trick. The improper integral 



I = 



ln(l - t) 



t 



dt 



is evaluated in two different ways. Expanding the logarithm in the numerator 
and integrating term by term yields 



I = 



r-l/2 



r-1/2 



1 / 


e 


/5 ^5 


— 


—t 




t \ 


2 


3 4 5 



/O 



, t ^3 

^ + 2 + 4 



e ^4 

4 + 

1 

2^ 



25 



( 1 / 2)2 ( 1 / 2)3 



+ y + 

^36 + 

(1/2)4 



6 



dt 

dt 



1/2 



(1/2)5 (1/2)6 



4 9 16 

Alternatively, substituting x = 1 — t gives 



25 



36 



I = 



'•4/2 ln(l - t) 



dt = 



^4/" In; 



nl/2 



(1 + ^2 

/ 1/2 

= / In xdx - 

The general result 



(1-a;) 
a;5 + a;4 + . . .) In xdx 
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X In xdx 
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n+1 



^2 In xdx - 



n+1 



a;5 In xdx 



x" In xdx = In a; — 7 77 

n +1 (n+ 1 )" 



1/2 

1 



can be used to evaluate the right hand side for n = 1,2,... to give 
I 



{xh\x — x) + ( — In a; — 



x^ , x^ 

— in a; — 
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Thus, after convincing himself that [ln(0)ln(l)] is zero(!) (an exercise in 
L’Hopital’s Rule for today’s students) Euler arrived at 



— = 2 ( - 
„ 2 ( 9 



1 1/22 1 / 2 ^ 






a sum which converges much more rapidly due to the power of two in the 
denominator of the right hand side sum. Using only fourteen terms, Euler 
obtained the answer 



V ^ ~ 1.644934 

n—1 

an answer we now know to be accurate to six decimal places. Evaluating tt^/G 
verifies this nicely. Euler had got the right answer, but could all the mathemat- 
ical steps be verified? Students could be asked to establish Wallis’ formula 

2 E3.3.5.5.7.7.9.9... 

n ~ 2.2.4.4.6.6.8.S... 

by putting x = tt/ 2 in the P{x) defined above. Another extension is to find 

OO - 

V- 



and perhaps 



LXJ - 

E/i 

n—1 



both of which Euler found, but which today can be obtained more easily. In- 
vestigations on the more general problem, finding 



OO ^ 

where fc = an integer, 



leads students towards the realms of the Riemann zeta function and outside 
undergraduate mathematics. 

Another fruitful extension, or more accurately parallel investigation, is the 
origin of the Euler constant 7 ~ 0.577215 ... a transcendental number defined 
by 

r " 1 

7 = lim > ln(n -|- 1 ) . 

n— ^00 • ^ T 

_r—l 
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The results: 



and 



7 = 




particularly the last, have a particularly pleasant symmetry and provide chal- 
lenges for students. 



5.8 Summary 

In this chapter several case studies have been presented. The material could 
form a standard taught module: on the other hand each could be the subject of 
a group project or individual project. The distinctive features of case studies are 
first that there is a lot of presented material which means that the lecturer has 
more control over the direction the case study takes. In turn this usually means 
that the mathematics within the case study can afford to be more technical 
which opens up a wider class of problems for the student body. For example, it 
would be a very good student indeed who could get as far as the surface ocean 
dynamics case study outlined in section 5.2 through an individual project. Case 
studies thus give an opportunity to explore a specific piece of mathematics in a 
way such that the scope is very well defined by the lecturer. However, precisely 
because the lecturer is closely defining the scope, the mathematics can be more 
technical and the topic can be pushed further than is normal in an individual 
project. 

Case studies can be assessed in a number of ways. Perhaps the most straight- 
forward is to set coursework, but this has the usual danger that the submitted 
work is a group effort, or more unacceptably the work of one bright student 
who has allowed others to copy. An alternative is to set questions to be done 
under examination conditions. Longer than normal questions can be set, and 
the students could be allowed to take material into the examination room (open 
book examination). The questions would test the knowledge each student has 
of the case study while inviting him or her through the questions to take it a 
little further. Here are some examples of examination questions. Try some of 
them. 
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5.9 Exercises 



1. By considering the equations for geostrophic flow and ignoring all vertical 
motion, show that the flow around a low pressure centre is anticlockwise 
in the northern hemisphere and clockwise in the southern hemisphere. If 
friction is included in the equations, deduce how this modifies the flow. 



2. Solve the time dependent Ekman equations where they take the form 



dW 



+ ifW = 



d'^W 



using Laplace Transform techniques. Use the following boundary condi- 
tions: 



dW 

dz 



(0,t) 



1 

T, 

pv 



W ^ Q as 2 ^ — oo 



and the sea is at rest initially. The complex form of the equations has been 
used whereby the horizontal flow (u,v) and the wind stress are 

expressed as W = u + iv and t = with i = Do not attempt 

to invert the Laplace Transform, but use the final value theorem 



lim W(z,t) = limsIU(s) 

t—*oo s — »-0 



where s is the Laplace Transform variable to deduce both the functional 
form of the steady Ekman spiral, and the spin up time of this spiral. 



3. Consider the damped Duffing equation 

1 

X + kx + X — -X = r cos cot. 
6 



Follow the procedure in the notes by substituting x = a{t) cos cot + 
b{t) sin cot, assuming both a{t) and b{t) are slowly varying. Neglect ka and 
kb, and hence show that to this degree of approximation 



a 

b 






Find the linear approximation in the neighbourhood of the equilibrium 
point when > 1. (Adapted from Jordan and Smith (1993).) 
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4. 



A pendulum with a light, rigid suspension is placed upside down on end, 
and the point of suspension is caused to oscillate vertically with displace- 
ment y given hy y = ecoswt, e <C 1. Show that the equation of motion 
is 



e + 



( — - + ) sin0 = 0 

\ a a J 



where a is the length of the suspension, g is gravitational acceleration and 
0 the inclination to the vertical. Linearise this equation for small amplitude 
oscillations and show that Mathieu’s equation is obtained. Investigate the 
equilibrium of this oscillation. (You may assume that Mathieu’s equation 



x+ {a + f3 cos t)x = 0 



has stability regions as given in Figure 5.6.) 




Figure 5.6 The stability diagram for Mathieu’s equation. The shaded area 
is an unstable region in which at least one solution is unbounded, elsewhere 
all solutions are bounded. (Dashed line, solution period 2tt exists; continu- 
ous line, solution period 47 t exists). From Nonlinear Differential Equations by 
D.W. Jordan and P.Smith (Second edition, 1987). Reprinted by permission of 
Oxford University Press. 



5. The traffic along a single lane is modelled using the equation 



dp 

dt 



^ipv{p)) = 0 
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where p{x,t) is the density of cars and v{p) their velocity. If 

v[p) = -Co 

Pmax 

and 

p(x,0) = pie-'=l"l 

where k and pi(< Pmax) are positive constants, determine when and where 
the solution first becomes undefined and hence a shock forms. Rework for 
the density given by 

( 0 for X < 0 

p{x, 0) = < for 0 < X < L 

[ Pi for X > L 



where L is a constant. (Adapted from Billingham and King (2000).) 

6. The flow of cars along a single lane road can be described using a continuous 
car density p(x, t) with a car velocity given by 

v{p) = -^^{Pmax - p)^ 

Pmax 



for 0 < /? < Pmax- Determine the equation satisfied by p, and calculate the 
kinematic wave speed, c{p). Show that c{p) vanishes when p = \pmax and 
has a minimum at p = ^Pmax- 

At time t = 0 the car density is 



p(x,0) 



PL + PRe^/^ 

1 + e“/-^ ’ 



with 0 < PR < I Pmax and ^Pmax < PL < Pmax- Sketch the function 
c(p(x,0)). Also sketch the development of the car density for t > 0. 
(Adapted from Billingham and King (2000).) 



7. Find a solution to the ordinary differential equation 



w" — 2zw' + 2kw = 0 



of the form 

/ e^^ifiOdC 

Jc 

describing two possible contours C. 
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8. Prove that the differential equation 

zw” + 2aw' — zw = 0 



9. 



where a is a real constant may be satisfied by assuming that 

w = [ 



Jc 

where C is a suitable contour. Determine two allowable contours. 
Determine the complete solution to the third order differential equation 

zw'" + tc = 0 



by using contour integrals. 

10. Investigate finding the minimum of the function 

f{xi,X2,X3) = 2{xi - 0.2)"^ + (x2 - O.l)'^ + 0.5(a:3 - 0.3)"‘ 

using the method of steepest descent. Check your progress at each stage 
by comparing the approximation with the obvious correct solution. 

11. Use the Davidon-Fletcher-Powell method to determine the minimum of 
the function 

f{x,y,z) = {x-yf + —{x + y+lf 

using two iterations. Show the the method of steepest descent is exact after 
only one iteration, but that the Newton-Raphson method converges only 
slowly. Give possible explanations. 

12. Find at least two mathematically illegal steps in Euler’s “proof’ from con- 
sidering the sine function that 

E l 7T^ 

V? 6 

n—1 

Given that the result is correct, how are the steps Euler took in his proof 
validated? 

13. “Euler’s contribution to efficient mathematical notation is second to none.” 
Discuss. 




A 

Project Example 1: Topics in Galois 

Theory 



Preamble 

For this project, the student explored an area of mathematics that was outside 
the course, and was difficult. The information was available and well embedded 
in the literature: however the concepts were advanced. In style therefore, this 
project most closely resembles the Hypergeometric Functions project outlined in 
Section 3.7. 



Abstract 

Galois Theory was originally formulated to determine whether the roots of a 
rational polynomial could be expressed in radicals. The theory is based on the 
association of a group to each polynomial and the analysis of the group to 
determine solubility in terms of radicals. We first study Galois Theory with 
the approach of the author Evariste Galois, then in its modern form based on 
abstract algebra, particularly field theory, which allows a more general interpre- 
tation. We then outline the theory of soluble groups and give some examples. 
Finally, we consider the application of methods in Galois Theory to the three 
classical construction problems and the construction of n-gons. 
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Introduction 

Evariste Galois (1811-1832) was a French mathematician who was born near 
Paris. Although he died in tragic circumstances at 20 years old he had already 
produced work that would make him famous. Unfortunately during his lifetime 
Galois suffered many setbacks and was considered to be a troublemaker by the 
government. He also found himself unaccepted by the mathematical establish- 
ment. It was not until 1846 that Joseph Liouville published some of Galois’ 
work in his Journal de Mathematiques [ED, pi]. Since then the importance of 
Galois’ results has been realised and his work has provided the impetus for 
many developments in algebra. 

In his Memoir on the Conditions for Solvability of Equations by Radicals 
Galois was solving a problem that interested many mathematicians including 
Lagrange. The solution to the general quadratic equation has been known since 
Babylonian times. In the 16th century formulae were discovered for the cubic 
and quartic. The solution for the cubic equation -\- px = q found by Niccolo 
Fontana is 




This is an expression built up from the coefficients of the equation using the 
usual arithmetical operators with the nth roots, or radicals [ST, p xivj. If the 
roots of an equation can be expressed in this way it is said to be “soluble by 
radicals”. Mathematicians hoped that they would be able to find formulas for 
general equations of degree n > 5. Lagrange worked for many years on the 
solution of general equations, finding improved solutions for both the cubic 
and quartic [ED, ppl8-22j. He was unable to find a solution for the quintic 
and expressed doubts that a solution existed. His work was almost certainly a 
source of inspiration for Galois. In 1824 Abel proved that the general quintic 
was not soluble by radicals so that a general formula did not exist but was 
unable to find a method for identifying which specific are soluble. This question 
was answered by Galois comprehensively in a piece of brilliant and original 
mathematics known as Galois Theory. This is the subject of this project and 
we now give an outline of each section. 

In Section A.l we closely follow the English translation of Galois’ memoir 
by Harold M. Edwards in his book Galois Theory [ED]. The style, both of the 
translation and accompanying text, is traditional and often difficult to follow 
so much of the material in this section is a modified interpretation. The book 
takes a concrete rather than abstract approach to Galois Theory, and Edwards 
has attempted to present the theory with as little modern algebra as possible, 
for the most part drawing only on the techniques available to Galois. Some 
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basic field theory is used in the text, particularly in the proofs, but we delay 
the introduction of fields until Section A. 2. 

In Section A. 2 we study Galois Theory in its modern form which makes use 
of the developments in abstract algebra over the last 150 years. Field theory 
has allowed mathematicians to interpret Galois Theory in a formal, abstract 
sense, suited to the more rigorous standards of definition and proof introduced 
at the turn of the century. The main source for this chapter is Ian Stewart’s 
Galois Theory [ST] which has a more accessible style than Edwards’ book. In 
separating the two approaches to Galois Theory we are able to appreciate the 
simplicity of Galois’ approach which the more structured methods of abstract 
algebra do not share although they allow Galois Theory to be applied to more 
general areas than rational polynomials. 

Groups play an essential part in Galois Theory and Galois’ work in this 
area provided a foundation for the study of group theory. This is expanded 
on in Section A. 3, where we give an outline of the relevant theory and some 
examples. Sources for this chapter include Stewart’s book and Allenby’s Rings, 
Fields and Groups [AL] which was useful for a more detailed approach and 
worked examples. 

Finally, Section A. 4 explores the applications of field theory to classical 
geometry, using the methods developed in our study of Galois Theory. Here 
we combine elements from Stewart, Allenby and also John R. Durbin’s Modern 
Algebra: An Introduction. 

This project is intended to be an introduction to Galois Theory although, 
due to the depth of the subject and restrictions upon the project, it has not 
been possible to give more than an outline of many of the proofs. It is hoped 
however that it will provide the reader with the impetus for a more detailed 
study of the subject. 



A.l Galois’ Approach 

A. 1.1 Preparation 

We begin this section by reviewing some facts from basic algebra. 

A. 1.1.1 Polynomials. We consider polynomials with rational coefficients un- 
less specifically declared otherwise. A polynomial of degree n is irreducible in Q 
if it cannot be expressed as a product of two or more polynomials whose degrees 
are less than n but greater than 0. The polynomial of degree 0 is the constant 
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polynomial. The factorisation of a polynomial into irreducible polynomials is 
unique up to constant factors and the order in which the factors are written. 
If f{x) = + . . . + a\x + ao and a„ = 1 then / is a monic polynomial. 

Since we are interested in the roots of a polynomial, when two polynomials 
differ only by a constant factor we consider them to be essentially the same. 
This is demonstrated by the example 

fix) = 2 = ^(x + 2)(x- 3) 

g{x) = x^ — X — 6 = (x + 2){x — 3). 

So / = but / and g have the same roots. 

A. 1.1. 2 Some Methods of Checking Irreducihility. If / is a polynomial that 
is irreducible in Z then / is also irreducible in Q. For example 

let g[x) = l-x^ + -X + 6 

^ ' 3 3 

and f{x) = x^ + X + 13 = 3g{x). 

Now / is irreducible in Z so / is irreducible in Q. 

Eisensteins’s Irreducihility Criterion [ST, p20] 

Let 

f{x) == a„x" + . . . + a\x + ao, ai G Z, i = 0,1, . . . ,n. 

Suppose there is a prime q such that | a„, q\ai for i = 0,1,2, ... ,n — 1, and 
< 7 ^ I ao then / is irreducible in Q. 

For example, if f{x) = x® + dx"* + 6x^ + 2x^ + 8x + 2 and we take q = 2 
then / is irreducible in Q. 

Unfortunately not every polynomial can be treated with this method. How- 
ever, we can sometimes rewrite a polynomial so that the method will be suit- 
able. For example, /(x) = g{x)h{x) if and only if /(x + 1) = g{x + l)h{x + 1). 
Hence /(x) is irreducible if and only if /(x + 1) is irreducible [ST,p21]. 

An example of this is /(x) = x‘*-|-x^-|-x^-|-x-|-l where Eisenstein’s criterion 
does not apply. But /(x -P 1) = x^ -P 5x^ -P lOx^ -P lOx -P 5 and is irreducible by 
Eisenstein’s criterion with q = 5. Hence /(x) is irreducible in Q. 

A. 1.1. 3 Roots of Polynomials. Each root of a polynomial is either simple or 
multiple (repeated). We will want to work with polynomials that have only 
simple roots, but it is quite straightforward to detect simple roots. 

If fix) has roots ai,a 2 , ■ . ■ ,an which are roots of multiplicity mi, m 2 , . ■ • , m„ 
respectively then 



fix) = (x - (x - . . . (x - 
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If we differentiate formally then 

Df = {x- air^-\x - a 2 r^-^ ...{x- ^ m, n(* " «i) 

j i¥=j 

and it is clear that if, for any i, rrii > I then / and Df have a common factor. 
Thus we can say 
Lemma 1 [ST,p85] 

A polynomial has a multiple root if and only if / and Df have a common 
factor of degree > 1. If / has no multiple roots then / and Df will be coprime, 
i.e. the highest common factor will be a polynomial of degree 0. 

Any irreducible polynomial with rational coefficients must have only simple 
roots since if it had a multiple root then / and Df would have a common 
factor of degree> 1 which can only be a multiple of /. But / is irreducible and 
Df has degree less than n so we must have Df = 0 which implies that / is a 
constant. 

A. 1.1. 4 Symmetric Polynomials. Symmetric polynomials play an essential 
role in Galois Theory as they allow us to express rational functions of the roots 
of a given equation in terms of the coefficients of the equation. Consider a 
general quadratic equation f(x) = x^ + bx + c. If the roots of / are a and /3 
then 



f(x) = (x-a)(x-/l) 

= x^ — (a + /3)x + a/3. 

We can immediately see that a + j3 = —b, afi = c. The functions a + j3 and 
a/3 are called symmetric polynomials since their values do not vary if a and /3 
are interchanged. 

A symmetric polynomial is any function in n indeterminates that is invari- 
ant under any permutations of the indeterminates. For example, if the roots of 
a polynomial are ai, 02, 03 then the following functions are symmetric polyno- 
mials in the roots: 

af ~h 02 ~h 03, Oi ~h 02 ~h O3, O1O2 ~h O1O3 ~h O2O3, O1O2O3. 



The last three examples are special cases of symmetric polynomials called ele- 
mentary symmetric polynomials. It is these functions that are directly related 
to the coefficients of the polynomial of which the indeterminates are the roots. 
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Let /(x) = a;” + &ix” ^ + 62 x” ^ + . .. + &„ = (x — ri)(x — r 2 )(x — r 3 ) ... (x — 
r„). The elementary polynomials are 

CTi = ri + X 2 + X 3 + . . . + r„ = -bi 

0-2 = riT2 + riX 3 + . . . + Tn-lTn = 62 

o-„ = Xir 2 r 3 ...r„ = 

where aj = 0 for j > n. The general formula is Ufe = {—l)^bk- 

Theorem 2 (Fundamental Theorem on Symmetric Polynomials) [ED, 
p9] 

Every symmetric polynomial in ri, X 2 , . . . , r„ can be expressed as a polynomial 
in the elementary symmetric polynomials CTi, (J 2 , . . . , cr„. Moreover, a symmet- 
ric polynomial with integer coefficients can be expressed as a polynomial in 
(Ti, CT 2 , . . . , CT„ with integer coefficients. (Proof [ED, pp9-12].) 

Let Sfc = + T 2 +r^ + ■ ■ ■ + r^ for k = 1,2,3,.... Then we have the general 

formula [ED, pi 2] 

Sk - Sk-l<Ji + Sfc_ 2 CT 2 - ■ • . + (-l)''“^SlCTfc_i -p (-l)'"/c(Tfc = 0/c = 1, 2, 3, . . . 

We can also express the elementary symmetric polynomials in n — 1 indeter- 
minates in terms of elementary symmetric polynomials in n indeterminates. 
Let the indeterminates be ri, X 2 , xs, . . . , r„ and let the elementary symmetric 
polynomials in the n indeterminates be ak- Let the elementary polynomials in 
n — 1 indeterminates xi, X 2 , X 3 , . . . , x„_i be r^. Then [ED, pplO-11] 



ri 


e 

1 

b 

II 




T2 


= 0-2 - X„(Ti -P X^ 




r3 


= 0-3 - X„(T2 -P X^CTi - 




Tn 


— ^n—1 ‘f’n^n—2 ■ ■ 


. + (-i)"-vri 



Finally, we note Lemma 1 from Galois’ Memoir [ED, p51]: 

Lemma 3[ED, pp51-52]: 

An irreducible equation cannot have a root in common with a rational 
equation without dividing it. 

Proof : Let g and h be polynomials and g be irreducible and let g and h have 
a common root. Then using the Euclidean algorithm we can write d = Ag + Bh 
where d is the greatest common divisor of g and h, and A and B are rational 




A. Project Example 1: Topics in Galois Theory 



157 



polynomials. If we substitute into this equation the common root, r, then we 
have d{r) = 0. Therefore d has degree > 0. Since d divides g and g is irreducible, 
d must be a non-zero multiple of g. Since d divides h it follows that g divides 
h. 



□ 



A. 1.2 The Galois Resolvent 

A. 1.2.1 Forming a Galois Resolvent. With the preliminary information de- 
tailed in Section A. 1.1 we first study Galois Theory through Galois’ original 
approach. The remainder of this chapter is drawn mostly from Edwards trans- 
lation of Galois’ Memoir [ED, Appendix 1, pplOl-113] and accompanying text. 
As a starting point we have the “given” equation. Let this be 

f{x) = x” -P -P . . . -P aix^ + oq. 

This equation should be irreducible and have n distinct roots ai, a 2 > • ■ • , «n- 
In addition, the coefficients of / are assumed to be rational. 

Our aim is to show whether or not the roots of / can be expressed in terms of 
radicals. The only information available is the coefficients of the given equation. 
However, symmetric polynomials allow us to extract information about the 
roots from these coefficients. 

Following earlier work by Lagrange ([AL, ppl81-186] and [ED, ppl8-22, 
pp32-35]) Galois constructed a function of the roots of the given equation. 
This function had particular properties and is now known as a Galois resol- 
vent. The conditions governing the function are stipulated by Galois’ Lemma 
II [ED, pI02]: 

Lemma 4: Given any equation with distinct roots a, &, c . . . , one can always 
form a function V of the roots such that no two of the values one obtains by 
permuting the roots in this function are equal. Further, the coefficients of this 
function can be chosen as integers. 

Proof: 

Let our Galois resolvent be 



Vq — XiO^i -p X2O2 -p . . . -p Xj^CX^j Xi G Z. 



We define a permutation of the roots by 4>{ai) = Any permutation (p € Sn 

of the roots will give a new function of the roots which we will call a conjugate 
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of the resolvent, say Vi. We shall denote the numerical value of any of the 
resolvent Vi by U. So if </> = (123 . . .n) then 

4>(Vo) = a;iai + X2a2 + . . . + x„a„ = Vi Vq yf Vi 

Note that we insist the value of each conjugate of the resolvent is different, not 
just the formal representation. We wish to show that the n! conjugates have 
distinct values so let 

D = Y[[xi (S'(ai) - 'T(ai)) + X2 (S{a2) - T{a2)) + . . .]^ 

S,T 

where the product is over the n!(n! — l)/2 pairs of unordered permutations S 
and T in which S' T. If we regard the Xi as variables then H is a polynomial 
in the Xi with coefficients that are polynomials in the H is symmetric in 
the ai so the coefficients are symmetric in the ai. This means that H is a 
polynomial with known (rational) coefficients. Since D is a product of non-zero 
polynomials it is non-zero. We can therefore choose the Xi as integers so that 
yf 0. 

□ 

A. 1.2. 2 Using a Galois Resolvent. Once we have a suitably chosen Ga- 
lois resolvent we can use it in a series of calculations that yield information 
about the given equation /. We form the “equation for V” by constructing 
a polynomial with the resolvent and its conjugates as its roots. Let these be 
Vq = to, Vi = ti, . . . , Vn\-i = tn\-\. Then the polynomial will be 

F{X) = {X- to){X - ti) ... (X - 

Note that F is invariant under any permutation of the roots of / since this 
would only interchange the ti. So the coefficients of F must be symmetric 
polynomials in the roots of / and are therefore known quantities. We can 
calculate these quantities as shown in the following example. This example will 
be used throughout sections A.l and A. 2 to demonstrate various aspects of 
Galois Theory. 

Let u{x) = x^ — 2. This is an irreducible polynomial with simple roots. Let 
the roots of u be a, b, c. We form the Galois resolvent, choosing coefficients that 
are integers. To make the calculations easier for this example, the coefficients 
have been chosen with the prior knowledge that they yield different values of 
the resolvent for each conjugation. Normally, such a calculation would need to 
be carried out without explicit values for the coefficient. Let the resolvent and 
its conjugates be 

Vq = tt -p 2b -p 3c, Vi = c -p 2a -P 36, V 2 — b 2c -P 3a 

V 3 = 6 -p 2a -p 3c, V 4 = c -p 2b -P 3a, V 5 = a -P 2c -P 3b. 
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Then the polynomial with the values of the resolvent and its conjugates as 
roots is 



U{X) 



{X - to){X - h){X - t2){X - t3){X - t4){X - Is) 

X^ — (tp + + ^2 “P ^3 “P ^4 “P t5)X^ 

[(to + ti)(t 2 + tp + ^4 + ts) + (^2 + t3)(t4 + tp)] X^ 

[toil + t2ts + t4ts] X^ 

[totl(t2 + ts + ^4 + is) + + is) + t4ts(t2 + t^)] X^ 

[(to + ti)(t2t3 + ^4^5 + {t 2 + t^){t4 + ts))] X^ 

[t2t3t4ts + (to + ti)(t2t3(t4 + ts))] X^ 

[(t4ts(t2 + tz)) + toti((t2 + t^){t4 + ts) + t2t3 + t4t5)] X'^ 
[(to + tl)t 2 t^tAtb + toti(t2t3(t4 + tp) + tit^{t 2 + t3))] X 

totlt2t3t4t5. 



If we expand the coefficients of U in terms of the roots a, b and c we find that 
they are symmetric polynomials in the roots. For example the terms for X^ 
and are 

l2{a + b+c)X^ 

[58(a^ + &^ + c^) + 122(a& + ac+ be)] X'^. 

Using the symmetric polynomials and elementary symmetric polynomials as 
defined in Section A.l we have the following information: 

CTl = (72 = 0, (73 = 2 

Si = S2 = S4 = S5 = 0, S3 = 6, S6 = 12. 

So our polynomial becomes 

U{X) = A® + totlt2t3t4t5 
= A® + 108 



which is irreducible in Q. 

In general the polynomial F will not always be irreducible but will decom- 
pose into irreducible factors so that F{X) = Gi{X)G 2 {X)G 3 {X) . . . Gs{X). 
Note that since the roots of F are the conjugates of the resolvent then each 
of the factors must have some of the conjugates as its roots, and each factor 
has roots distinct from the roots of each other factor. We choose Gi to be the 
factor that has Vq = to as a root. 
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A. 1.2. 3 Expressing roots in terms of the Galois Resolvent. We now move on 
to Galois Lemma III [ED, pl03] 

Lemma 5: When the function V is chosen as indicated above, it will have 
the property that all the roots of the given equation can be expressed as ratio- 
nal functions of V. 

Proof: First, Galois considered the conjugates of the resolvent that held one, 
arbitrarily chosen, root fixed and permuted all the others. There are (n — 1)1 of 
these conjugates and we use them to construct a polynomial with them in the 
same manner we used for F. V is always one of these conjugates that holds a 
root fixed. Let the others be Ei, V2 , . . . then 

Fa{X) = {X- to){X - U){X - U +,) ... (X - U+s)- 

Fa is symmetric in the roots that have been permuted. It can be shown that any 
one root of a given equation can be expressed in terms of elementary symmetric 
polynomials in all the other roots [ED, pplO-ll]. So we can eliminate from Fa 
all but the fixed root, and we can say 

Fa{X) = Fa{X, ai) where «i is the fixed root. 

Replace a\ with the indeterminate Y and substitute to for X so that we have 
Fa{toiX) and f{Y) as polynomials in Y with coefficients that are rational 
functions in tg- We find the greatest common divisor d{Y) of Fa{to,Y) and 
f{Y). Now {Y—ai) divides Fa{to, Y) and f{Y) so that d{Y) = A{Y)Fa{to, F)-|- 
B{Y)f{Y). Setting Y = 0 gives 

7(-ai) = A(0)F,(to,0) + B(0)/(0) 

0:1 = hi(to) 

where hi is a polynomial in tg with rational coefficients. We can perform a 
similar process for each of the roots in turn. If we take the conjugates of the 
resolvent that hold another of the roots fixed we form another equation, say 
CK 2 = ^2(^0)- This is most clearly demonstrated by returning to the example 
u(x) = — 2 with roots a, b, c. We see that Vg and V5 leave a fixed: 

Uo = (X-to){X-t5) 

= X^ — (to + tg)X + tgig. 

Now tg + tg = 2a + 5(6 + c) and tgtg = + 6(a^ + 6^) + 5a(b -P c) -P 136c. If n 

are the elementary symmetric polynomials in 6 and c then we have 

Ti = 6 -P c = —a, T 2 = bc = a^, 6^ -P = rf — 2 t2 = —a^. 
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So Ua{X, a) = + 3aX + 3a^. The greatest common divisor of Ua and u is d 

where 

d{Y) = ^{2tlY + 4-6). 

2^2 6 — 

Setting d{Y) = ^{Y — ai) and T = 0 we find 7 = and a = hi{to) = ° 

o 



6 0 I 

and similarly we can show b = h 2 {to) = — j ^ 3 (^ 0 ) = 2 • 

to 2to 



2tl 



□ 



Since the conjugates of the resolvent are obtained by a permutation of the 
roots of the given equation / we should expect that the roots can be expressed 
in terms of all of these conjugates. We have Galois’ Lemma IV [ED, pl04]: 



Lemma 6: 

Suppose one has formed the equation for V, and that one has taken one 
of its irreducible factors, so that V is the root of an irreducible equation. Let 
V,V ,V” , . . . be the roots of this irreducible equation. If o = f{V) is one of the 
roots of the given equation, f{V) will also be a root of the given equation. 



Proof: 

In the notation we are using we have 



f{X) = Gi(V)G2(V)G3(V) . . . G,{X) 



where to is a root of Gi which has deg. Gi roots taken from the U. So if 
we have ai = hi(to) then perform the permutation 6 = (123... n) we have 
02 = hi{9{to))- 



□ 

6 — 

Returning to our example, we have a = hi{to) = 2 ° ■ The permutation (132) 



1 1 7 / N 6 — to 

takes to to t 2 and a to c so c = hi{t 2 ) = 2 ■ 

2^2 



2tl 



A. 1.3 The Galois Group 

A. 1.3.1 Finding the Galois Group. We come now to a fundamental part of 
Galois Theory. Any given equation has a particular group associated with it 
and in this section we show how Galois found this group. Firstly we quote 
Galois’ Proposition 1 [ED, pl04]: 
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Theorem 7: 

Let an equation be given whose m roots are a,b,c, . . . . There will always be 
a group of permutations of the letters a,b,c, . . . which will have the following 
property: 

1 . That each function invariant under the substitutions of this group will be 
known rationally, 

2 . Conversely, that every function of the roots which can be determined ra- 
tionally will be invariant under these substitutions. 

We have F{X) = Gi{X)G2{X)G3{X) . . . Gs{X). Let deg Gi be m so the 
roots of Gi are m of the ti including tg. The given equation / has n roots that 
can all be represented as functions of the ti. We can construct a table as below: 





^2(^0)) 


hsito), ■ 


■ ■ ^n(^o) 


bl (ti ) , 


h2(ti), 


^3(^2)? 


■ ■ hn{ti) 


hi(tj), 


h2(tj), 


h 3 (tj)j 


hn {tj ) . 



This table has m rows representing the m conjugates of the resolvent and n 
columns representing the roots of the given equation f. This table gives us the 
Galois group since each row represents one arrangement of the roots. We can 
demonstrate this by using our example u{x) = — 2 once again. The table of 

functions of the C is 

^i(^o)j ^2(^0)) ^3(^0) 

hi{ti), h2{ti), /i3(ti) 

hi{t2), ^2(^2), hz{t2) 

^ 2 (^ 3 )) hz{H) 
h2{ti), hz{ti) 

^2(^5)) hz{H) 

which gives 

a b c 
cab 
b c a 
b a c 
c b a 
a c b. 

These permutations are those contained in the whole of the symmetric group 
S3. So the Galois group is for m in S3. 

Proof of Theorem 7: 

This breaks down into several parts. We need to show: 
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i) the entries in the table are all roots of /, 

ii) no root appears in a row more than once; 

iii) points 1 and 2 regarding invariant and rational functions; 

iv) closure of the group represented in the table. 

i) is a polynomial in X with rational coefficients. Now hi{to) = «i so 

f{hi{to)) = 0. This shows that to is a root of f{hi{X)). But to is also a root of 
Gi, an irreducible equation. By Lemma 3 we must have G\ divides f{hi). So 
every root of Gi is also a root of f{hi). Similarly, every root of Gi is a root of 
/(^ 2 ), /(^s), etc. So all the entries in the table are roots of / [ED, p52]. 

ii) Let am and be two roots of / so that hm{to) = oim and = <a„. 

Suppose 



hm(ti) hn(ti) = 0 

where ti is a root of Gi. Then ti is a root of hm{X) — hn{X). So by Lemma 3 Gi 
divides hm{X) — h„{X) and to is a root also. But then hm{to) = h„{to), am = 
an and the roots of / are distinct. So we cannot have hm{ti) = hniU) and the 
entries of each row of the table are distinct. 

iii) Let P(oi, . . . ,a„) be a rational polynomial in the n roots of /, such 
that P is invariant under any permutation of the roots. Since each root can be 
expressed as a function of the Galois resolvent to we have 

, . . . , an) .^(^l(^o); • ■ ■ ? (^o) ) ■ 

But since P is invariant under the permutations of the roots we must have 
P{hi{to),...,hn{to)) = P{hi{ti),...,hn{ti)) for alHi roots of Gi. 

Let (hi{ti ), . . . , hn{ti)) = h{ti). So we can say 

P{h.{to)) = — [P{h{to)) + P{h{U)) + • . •] where m = degGi 
m 

But this is a symmetric polynomial in the roots of G\ and can therefore be 
expressed in terms of the rational coefficients of Gi . So 

P{h{U)) G Q ^ P{ai , . . . , a„) G Q. 

Conversely, assume that P(ai, . . . , o;„) G Q so that P{h{ti)) G Q. Now 
P{h.{X)) is a polynomial with rational coefficients so P{h{X)) — P{h{to)) 
is also a rational polynomial which has a root to- Therefore by Lemma 3 
P{h{X)) — P{h{to)) is divisible by Gi and has the roots of Gi in common. 
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But then P{h{ti)) = P{h{to)) for any ti a root of Gi. So P is invariant under 
any permutation of the roots of /. 

iv) To prove closure we show that no matter which row arrangement we 
begin with we obtain the same group of permutations. Let s and t be two 
different Galois resolvents for /, let (fihe a permutation belonging to the Galois 
group of / and let ai, 02 be the roots of / such that «2 = 4>{cn)- 

Suppose = hs{s) and a\ = ht{t) then 02 = 4>{hs{s)) = hs{(j){s)). We 
want to show that 4>{ht{t)) = 02 - 
Now 



h.s{s) 

hs{s) - ht{t) 
(j>{hsis) - ht{t)) 

hs{(j){s)) 

02 



ht{t) 

0 0 G Q 

0 

hMit)) 



Since the elements of the group are independent of the choice of Galois 
resolvent, they must also be independent of the choice of which conjugate of 
the resolvent we begin with. 

□ 



A. 1.3.2 Galois’ Examples of Groups. Galois highlights two examples of 
groups. The first is the general polynomial of degree n where the coefficients 
are not explicitly known but are rational. In this case the only functions that 
can be determined rationally are the symmetric polynomials in the roots. Ga- 
lois states, without proof, that the Galois group of a general polynomial of 
degree n is the symmetric group Sn containing n! permutations of the roots. 
We delay discussion of this point until Section A. 2, accepting for the moment 
the statement without proof. 

The second example given is the cyclotomic equation of degree n 
x” — 1 

= + x”~‘^ -P . . . -P X -P 1 where n is prime. 

X — I 

Galois shows that the group for this polynomial consists of n permutations 
only. We will use this group as an example in Section A. 3 which concentrates 
on soluble groups. 
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A. 1.4 Soluble Equations and Soluble Groups 

A.1.4-.1 Adjoining Quantities to an Equation. At this point it is worth re- 
viewing our progress so far. For our given equation / we can find a Galois 
resolvent, say Vq, and form an equation F of degree n! such that 

n! — 1 

F{X) = 1[{X-U) 

i^O 

= Gi{X)G2{X)G3{X)...Gs{X) 

where the Gi are irreducible polynomials with rational coefficients and roots 
ti- We choose Gi to be the polynomial with one of its roots to- By expressing 
each root of / as a function of to we can construct a table which leads us to 
the Galois group of /. Importantly, the number of rows in the table, which is 
deg Gi, is the order of the Galois group. 

We now move on to the analysis of the Galois group. Galois was interested 
in how the group changed as certain quantities were adjoined to the given 
equation /. If we wish to express the roots of / in radicals we will adjoin the 
required radicals, which will all be of smaller degree than n. In fact, we need 
only adjoin radicals of prime order or, if we want the jth root where j is not 
a prime, then we decompose j into the product of primes. Say j = jij 2 j 3 • ■ • 
then we can construct a jith root of a j 2 th root of a jsth root, etc. So we are 
only interested in adjoining pth roots where p is a prime. 

We are interested in adjoining sufficient quantities to the given equation / 
to allow us to decompose it into linear factors. If this is possible then the equa- 
tion is obviously soluble by radicals. We have Galois’ Proposition II [ED, pI06]: 

Theorem 8: 

If one adjoins to a given root r of an auxiliary irreducible equation, 

1. Either the group of the equation will not be changed, or it will be parti- 
tioned into j groups where j divides p, each belonging to the given equation 
respectively when one adjoins each of the roots of the auxiliary equation; 
if p is prime then the partition is into 1 or p groups. 

2. These groups will have the remarkable property that one will pass from 
one to the other in applying the same substitution of letters to all the 
permutations of the first. 

The auxiliary irreducible equation that Galois had in mind is of the form = k, 
the solution of which gives us primitive pth roots of k. Let these roots be 
r, ar, a^r, . . . , a^~^r where a is a pth root of unity. Gauss’ work had shown 
that pth roots of unity are expressible in radicals for any prime p (although 
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Edwards disagrees [ED, p27]). So Galois always assumed that pth roots of unity 
had already been adjoined to the given equation. 

Proof: 

Galois does not give full proof, but Edwards’ detailed proof [ED, pp59- 
61] is outlined here. If Gi remains irreducible then the Galois group of the 
given equation / remains unchanged. Otherwise, Gi reduces to the product of 
irreducible factors Gi{X) = Hi{X)H 2 {X) . . . Hy{X). The new Galois group is 
of order deg Hi and consists of the arrangements of the roots of / expressed as 
functions of those conjugates of Vq that are roots of Hi . Again we must choose 
Hi to be the factor of Gi that has Vq = to as a root. Since we have adjoined to 
/ the root r of = fc we can represent Hi as Hi{X,r). Edwards shows that 
by substituting for i = 1, ... ,p — 1 into Hi we can form the product 

h{x) = Hi{X, r)Hi{X, ar) . . . Hi{X, 

Since Gi and Hi have deg Hi roots in common and coefficients that are rational 
functions of r then by Lemma 3 Gi(A) = Hi{X,d^r)Q{X,r) where Q is a 
polynomial with coefficients that are also rational functions of r. We can also 
show that 



Gi(A) = Hi{X, a'‘r)Q{X, aV) for i = 1, . . . — 1. 

So we can say Gi(A)^’ = h{X)q{X) and by Lemma 3 



r h{x) 1 


r 1 


[Gi(AyJ 


Gi{X)P-3 _ 



So h{X) =constantxGi(A)l for some integer j. This gives us deg h = pdeg 
Hi = jdeg Gi- The index of the new Galois group in the old Galois group 
is deg Gi/deg Hi and this, by the above equation, is p/j. If p is prime then 
either j = 1 or p so the index is 1 or p. Obviously, the value of 1 relates to Gi 
remaining irreducible. To show the second part of Theorem 8 we first note that 
when j is 1 we have 

constant x Gi(A) = Hi{X, r)Hi{X, ar) . . . Hi{X, 

The roots of Hi{X,r) are a subset of the roots of G(A), as are the roots of 
Hi{X, ar). Let the two roots of Hi{X, r) be to and ti and a root of Hi{X, ar) 
he t2- X row of the group table for Hi is 

q;i = hi{to),a2 = h2{to), . . . ,an = hn{to)- 



If S' is a permutation belonging to the new Galois group and S{to) = ^2 then 

S(ai) = hi{t2),S{a2) = /i2(t2), • ■ • , S(a„) = /i„(t2) 
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is a row of the table for Hi {x, ar) . Edwards shows that if we apply S to the row 
of the table which contains functions of ri then we obtain 
hiits) , h 2 {ts) , . . . , hnits) where is also a root of Hi{X,ar). So by applying 
the same permutation we move from the new Galois group to another subset 
of the old Galois group, relating to another factor of Gi. A subgroup with 
this property is known as a normal subgroup. A normal subgroup partitions a 
group into conjugacy classes. In the above context these conjugacy classes are 
the subsets of the old Galois group relating to each irreducible factor Hi of Gi . 
As a reminder, a subgroup i? of a group A is normal if and only if for a G A 
and b G B we have a~^ba G B. 

The process of adjoining quantities is continued until the Galois group of 
/ has been reduced to the identity permutation only. At this stage we have 
adjoined all the necessary radicals to / and F has reduced to linear factors. 
Then the equation of / is soluble for its roots. 

□ 

This shows that if / is soluble by radicals then its Galois group will reduce by a 
series of subgroups, each a normal subgroup of prime index in its predecessor. 
Let the Galois group of / be Bq. Ignoring any groups that are identical to their 
predecessors, we have the series 

Bol>Bi\>B 2 l>. ■ .\>By = e where \Bi-i\/\Bi\ = p with p prime and i = 1,. . . ,v. 

We call a group soluble if it has such a series of subgroups. 

Gonsider our example u{x) = x^ — 2. If we adjoin a primitive cube of unity, 
y, then decompose into two factors: 

U{X) = (X3-2(75y2 + 69y + 72))(A3-2(6V + 75?/ + 72)) 

= Vi(X)V 2 (X). 

The irreducible factor Vi has Galois group A 3 which is the alternating group 
of degree 3. A 3 is a normal subgroup of S 3 of index 2. If we now adjoin z, a 
real cube root of 2, then U can be expressed in linear factors since the roots of 
u are a = z, b = yz, and c = y'^z. This gives 

U{X) ={X — {z + 2yz + 3y^z)){X — {y'^z + 2z + ‘iyz)){X — (yz + 2y‘^z + 82;)) 
x(A — {yz + 2z + 3y^z)){X — {y'^z + 2yz + 3z)){X — {z + 2y‘^z + iyz)). 

So the series of subgroups for the Galois group of u is S '3 [> A 3 l> e where e is a 
normal subgroup of A 3 with index 3. 

Galois’ Propositions III and IV [ED, pI07] can be dealt with fairly briefly. 
Proposition III states: 
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Theorem 9: 

If one adjoins to an equation all the roots of an auxiliary equation, the 
groups in Theorem II [Theorem 8 here] will have the further property that 
each group contains the same substitutions. 

Our choice of r, as a solution of the auxiliary equation, is completely ar- 
bitrary so we must necessarily obtain the same partitioning of the group for 
each root r, r', r", . . . , of the auxiliary equation. If we adjoin all these roots 
then rather than splitting the Galois group of / into conjugacy classes, each 
relating to one irreducible factor of G\, we have the same normal subgroup of 
the Galois group applying to each factor. 

Theorem 10: 

If one adjoins to an equation the numerical value of a certain function of 
its roots, the group of the equation will be reduced in such a way as to contain 
no permutations other than those which leave this function fixed. 

A. 1 . 4.2 Finding Soluble Groups and Soluble Equations. Proposition V [ED, 
pl08] asks the question which is the motivation behind Galois Theory-“In 
which case is an equation solvable by simple radicals?” 

We know that the group of the equation must have a series of normal 
subgroups of index p which relate to the adjunction of pth roots and the series 
must end with the identity permutation (a fuller discussion of soluble groups 
follows in Section A. 3). Galois gives a rather compact proof of this [ED, pl09j. 
Edwards expands this proof considerably covering some points missed by Galois 
[ED, pp61-64j. 

Galois gives the example of the general quartic equation [ED, pl09j. The 
group has 24 elements and the adjunction of a square root decomposes the 
group into two subsets of order 12. The new Galois group is A 4 which is a 
normal subgroup of S 4 of index 2. Then a cube root is adjoined and A 4 splits 
into 3 subsets. The new Galois group is {e, (12)(34), (13)(24), (14)(23)} which 
is a normal subgroup of A 4 of index 3. We then adjoin another square root to 
give a Galois group {e, (12)(34)} which is a normal subgroup of the previous 
group, of index 2. Finally we adjoin another square root to reduce the Galois 
group to the identity permutation. Therefore the general quartic is soluble by 
radicals. 

More interesting are the general equations of degree > 5. Gonsider the 
general quintic equation. The Galois group is S'5 so we need to find whether S 5 
is soluble. Here we quote a theorem which Edwards has derived from Galois’ 
memoir [ED, p91] 
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Theorem 11: 

If an irreducible equation has prime degree p and is solvable by radicals, 
then the roots of the equation can be ordered oi, 02 , . . . , Op in such a way that 
the permutations s of the Galois group are all of the form s{ai) = Qri+s where 
Gi is defined for all integers i by setting Oj = aj for i = j(modp), where r and s 
are integers, and where r yf O(modp). The proof can be found in Edwards [ED, 
pp 91-93]. 

If the general quintic equation is soluble by radicals then, by the above 
theorem the symmetric group S '5 should contain only permutations of the form 
s(oi) = Gri+s- If we are working modulo 5 then there are 5 values of s but 
only 4 values of r that we can use. This gives 20 possible permutations but 
S '5 has 120 elements. Therefore the general quintic equation is not soluble by 
radicals [ED, p91j. This does not mean that every quintic is insoluble, but it 
is not possible to express the roots in terms of a formula in the coefficients of 
the equation using arithmetical operators and radicals. 

Since S 5 is a subgroup of every symmetric group S„ for n > 5 no symmetric 
group is soluble for n > 5 [ED, p90[. This means that there is no formula for 
the solution of general equations of degree greater than the general quartic. 



A. 2 The Modern Approach 

A. 2.1 Field Extension 

A. 2. 1.1 Constructing Field Extensions. For this section we shall assume that 
the reader understands the basics of ring, field and group theory although 
some theorems will be quoted (without proof) to highlight their importance. A 
useful basic text is Allenby’s Rings, Fields and Groups [AL] This text covers all 
the preparatory work for modern Galois Theory but stops short of the theory 
itself. We will also need to refer to irreducibility, multiple roots and symmetric 
polynomials, all of which were introduced in Section A. 1.1. 

In Section A.l we considered polynomials with rational coefficients, i.e. 
polynomials over the field of rational numbers Q. Modern algebra provides a 
framework for Galois theory in which it can be applied to a wider subject area 
than rational polynomials although most of our examples in this section will 
still be rational polynomials. 

Through the rest of this chapter we fix a general field K (not necessarily 
Q) However we need to know whether or not K is “like” Q in one important 
sense. Recall 
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Theorem 1 [ST, pp 3-4] 

1. The prime subfield of a field K is the intersection of all subfields of K. 

2. Every prime subfield is isomorphic either to the field Q of rationals or the 
field Zp of integers modulo a prime number p. 

3. If the prime subfield of K is isomorphic to Q we say K has characteristic 0. 
If the prime subfield of K is isomorphic to Zp we say K has characteristic 
P- 

4. If iti is a subfield of L then K and L have the same characteristic. 

Most of the examples we will see in this section have characteristic 0 and most 
of the theory is independent of the characteristic. However, there are some 
theorems that are sensitive to the characteristic and these will be highlighted 
where they occur. 

Galois used the concept of adjoining roots to a given equation and this 
translates easily into the theory of field extensions. If f{x) = {x^ + I)(x^ — 5) 
then / is a polynomial over Q since its coefficients are in Q. We say the factors 
of / are “irreducible over Q” if there are no elements in Q that would allow us 
to reduce / any further. If we “extend” Q to include i = \/—l we denote the 
field generated (as a subfield of C say) as Q(i) which reads “Q adjoined with 
f”, then 

f{x) = {x + i){x — i){x'^ — 5) 
as a polynomial over Q(t). Similarly 

f{x) = {x + i){x — i){x — V5){x + V5) 

as a polynomial over Q(z, v^)- We denote this as a field extension by Q(z, VS) : 
Q if we want to make clear what the “ground” field is, being the one we started 
with. We can classify with 

Definition 2 [ST, pp33-34] 

1. A simple extension is an extension L : K having the property that L = 
K{a) for some a G L. 

2. Let K{a) : K be a simple extension. If there exists a non-zero polyno- 
mial p over K such that p{a) = 0 then a is an algebraic element over 
K and the extension is called a simple algebraic extension. Otherwise a 
is transcendental over K and K(a) : K is called a simple transcendental 
extension. 
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For example, the polynomial f{x) = a„x" + . . . + aix + oq is an element of a 
simple transcendental extension of Q, that is, the field of all rational expressions 
in X. If there is a polynomial p over Q such that p{x) = 0 then p = 0. 

Field extensions are an important part of Galois Theory since the way we 
build a field extension, to allow us to solve an equation for its roots, is inti- 
mately related to the structure of the Galois group of the equation. 

Theorem 3[ST, p35,45,47]: 

1 . Let L : AT be a field extension and suppose that a G L is algebraic over K. 
Then the minimum polynomial of a over K is the unique monic polynomial 
m over K of smallest degree such that m{a) = 0. 

2. The degree of an extension L : K is denoted [L : K], The degree of a field 
extension L : K is the dimension of L considered as a vector space over K. 

3. If K, L, M are fields a.nd K C L C M then [M : K] = [M : L][L : K]. 

4. Let K{a) : K he a simple extension. If it is transcendental then \K{a) : 
K] = degm where m is the minimum polynomial of a over K. 

Examples 

1. Gonsider C : R. We can construct C by adjoining {l,t} to R as every 
element of C is of the form x + iy where x,y G R. Then {1, t} forms a basis 
for C over R so [C : R] = 2. 

2. The extension K{x) : K where x is transcendental over K has infinite 
degree since the elements {l,x,x^, . . .} are linearly independent and form 
a basis for K{x) over K. 

Finally in this section we have the following: 

Lemma 4 [ST, pp47-48] 

1. A finite extension is one whose degree is finite. 

2. An extension L : K is algebraic if every element of L is algebraic over K. 

3. L : K is a finite extension if and only if L is algebraic over K and there 
exist finitely many elements ai, . . . ,ag G L such that L = K(ai, . . . , Os). 

A. 2. 1.2 Splitting Fields. Although formal field theory did not exist in Galois’ 
time he would have had an intuitive notion of fields. When he wrote “. . . a pth 
root of unity, a, is included among the quantities that have already been ad- 
joined to the equation.” [ED, pl08], this would be in modern terminology Q(a). 
The concept of a splitting field is quite simple. We construct field extensions 
over our ground field until we are able to express the given equation in linear 
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factors and, therefore, find the roots of the equation. The splitting field will 
contain all the roots of /. 

For example, in Section A.l we found that the roots of u{x) = — 2 were 

z, yz, y'^z where y is a primitive cube root of unity and z the real cube root 
of 2. We first construct Q(y) : Q but find that u does not completely reduce. 
We then construct Q(y, z) : Q(y) so that u{x) = (x — z)(x — yz)(x — y'^z). Our 
splitting field for u is Q(y, z). 

Theorem 5 [ST,p78-79]: The field 27 is a splitting field for the polynomial 
/ over the field K ii K C E and 

1. / splits into linear factors over 27. 

2. 27 = K{ai , . . . , a„) where «i, . . . , o;„ are the roots of / in 27. 

In the example u(x) = x^ — 2, if we construct the field Q(z) first then one root 
of u lies in Q(z). But Q(z) : (Q) is not a normal field extension as the other 
roots of u are not in Q(z) and u does not split Q(z). 

Definition 7 [ST, p83]: 

An irreducible polynomial / over a field K is separable over K if it has no 
multiple roots in a splitting field. This means that in any splitting field / takes 
the form 

f(x) = k(x - ai) ... (x - a„) 

where the are all different. An irreducible polynomial over a field K is 
inseparable over K if it is not separable over K. For example, let Kq = Zp 
where p is prime and let K = Kq(u) where u is transcendental over Kq. If 

f{x) = x^ — u G K[x] 

then let L be a splitting root for / over K. Let r be a root of / in L so that 
tP = u. We have 

{x - t)p = xP + (^xP~^{-t) + . . . + {-t)p. 
f p\ 

But p divides I 1 for 0 < r < p and any multiple of p in AT is zero. So 
{x — t)P = xP — tP = xP — u = f{x). 

li aP — u = 0 then (a — u)p = u so a = t and all the roots of / in L are 
equal [ST, p85]. 

In Section A.l we showed that any irreducible polynomial with rational co- 
efficients had only simple roots. This result holds for any irreducible polynomial 
over a field K where K has characteristic 0. However the result is different for 
fields of characteristic p > 0. 
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Proposition 8 [ST, p86j: 

If iT is a field of characteristic 0 then every irreducible polynomial / over 
K is inseparable if and only if 

f{x) = krX''^ + . . . + X^ + ko ko, . . . ,kr G K. 

If / is defined as above then 

Df = rpkrX^^~^ + (r — l)pkr-ix''''~^^^~^ + . . . +pkix^~^. 

Since every multiple of p is zero in a field of characteristic p > 0 the result 
Df = 0 does not imply that / is a constant. However, if / has terms that are 
not raised to a power that is a multiple of p then Df will contain terms which 
are identically zero only when their coefficient is zero. 

The last result in this section is 

Lemma 9 [ST, p86] 

Let L : AT be a separable algebraic extension and let M be an immediate 
field. Then M : K and L : M are separable. 



A. 2. 2 The Galois Group 

A. 2. 2.1 Automorphisms. If we are able to construct a finite separable normal 
extension for a given equation we can relate the Galois group of the equation to 
the field extension through the operation of mappings on the field extension. We 
can define a mapping from one field to another that satisfies certain conditions. 
If 9 : K ^ L is such a map from the field K to the field L and x,y G K then 

0{x + y) = 9{x) + 9{y) 

0(xy) = 0{x)9{y). 

We shall be concerned with automorphisms which are maps of this type. We are 
particularly interested in automorphisms which fix our ground field K which 
are defined as follows 

Theorem 10 [ST, p72] 

1. If AT and L are two fields with K C L and 6 : L ^ L is an automorphism 
such that 9{k) = k for all k G K then 0 is a iG-automorphism of L. 

2. The set of all AT-automorphisms of L forms a group under composition of 
maps. 
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3. The Galois group F{L : K) of the extension L : K is the group of all 
itT-automorphisms of L under composition of maps. 

Thus we have a connection between a field extension and its Galois group. 
Proof(of(2)) 

If Of and l3 are it'-automorphisms of L then a/3 is also an automorphism. 
For k G K 

a(3{k) = a{k) a/3 is a iF-automorphism. 

If e is the identity then e is a iF-automorphism since e(k) = e 

a~^ is an automorphism of L and k = a~^a{k) = a~^{k) a~^ 

is an automorphism of L. 

□ 

Let us return again to the example u(x) = — 2. Suppose the field extension 

is Q(y, z) : Q then we have the following possible Q-automorphisms of Q(y, z). 

0 ■ 0{y) = y ^{z) = yz, 0^ = e 

<t> ■ 4>{y) = 2 /^ Hz) = z, H = e 

We can construct the following table which gives all possible Q-automorphisms 
of Q(i/,z) 

(Q) -automorphism Effect on y Effect on ^ 
c y z 

0 y yz 

y y^z 

4) y'^ z 

(f>0"Z = Q(j) 

(j)9 = y2 y2^ 

So r(Q(x, y) : Q) = {e, 0, 0^, 4>, 0<f), 0"^(j)}. This group is isomorphic to S '3 which 
is the Galois group of u that we found in Section A.l. 

A.2.2.2 Subfields and Subgroups. The elements of the Galois group relate to 
the splitting field by defining subfields. Each subfield has its own automorphism 
in the larger field and these also form a group. The automorphisms of a splitting 
field are generally the permutations of roots such as we considered in Section 
A. 1.2. Subfields may be considered to be intermediate stages in the adjunction 
of roots although the process is somewhat different in this context. 
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Definition 11 [ST, pp73-75] 

If L : AT is a field extension let M be an “intermediate” field such that 
K C M C L. To each intermediate field M we associate a group M* which 
contains all the M-automorphisms of the field L. F{L : M) is a subset of the 
Galois group T(L : K). Now K* is the group of all Tf-automorphisms of L 
and is therefore the whole Galois group, whilst L* contains only the identity 
automorphism. If M C TV then M* D N* since any automorphism fixing the 
elements of N will fix the elements of M. 

Working from the other direction, given a subgroup of T(L : K) we can 
associate with it a set of fixed elements of the field L. li H is a subgroup of 
r{L : K) then H denotes the set of all elements in L fixed by the automorphisms 
in H. 

The relationship between the subgroups and subfields is given by 



Lemma 12 [ST, p74] 

If 77 is a subgroup oi F{L : K) then i7 is a subfield of L containing K. We 
call H the fixed field of i7. If i7 C G then HFG since any elements fixed by G 
will be fixed by 77. If M is an intermediate field and i7 is a subgroup of the 
Galois group then M C M* and 77 C 77*. 

The example u{x) = — 2 has the following fixed fields and related sub- 

groups: 



Subgroup of F{Q{y, z) : Q) 



{e,6|2</)} 

{e,9(j>} 

{e,0,02} 

{e, 9, 9"^, (j), 9(j), 



Fixed field 

Q(?/g) 

Q(t/2) 

Q{yz{y + z)) 
Q{z) 

Q{y) 

Q 



A more complicated example is the field extension Q(\/2, -\/3, V5) : Q. Let this 
be L : Q. Let 



Cl = y/2, T2 = -V2, T3 = Vs, T4 = -Vs, rs = V 5 , = -V 5 



then the Q-automorphisms can be represented as permutations on the rt. li 9 
is a permutation then 9{ri) = Our Galois group is then 

F{L : Q) = {e, (12), (34), (56), (12)(56), (34)(56), (12)(34)(56)}. 

Some proper subgroups of F{L : Q) are 

B = {e,(12)(34),(12)(56),(12)(56)}, G = {e, (12), (34), (12)(34)}, 

D = {e,(12),(56),(12)(56)}, 7f = {e, (34), (56), (34)(56)}, 

F = {e,(12)(34)}, G={e,(12)(56)}, 77 = {e, (34) (56)}, 

R = {e,(12)j, ^={e,(34)j, T={e,(56)j, 7/ = {e, (12)(34)(56)|. 
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The fixed fields associated with these subgroups are 

B = Q, C=Q{V5), D=Q{V3), E=Q{V2), 

F = Q{V5,V6), G= Q(^/3,^/^0), H=Q{V2,Vl5), 

R = Q{V3,V5), S=Q{V2,V5), f=Q{V2,V3), 

U = Q(^/6,^/Toy^5). 

We also have e = K and F{K : Q) ' = Q. 

In summary we have 

Definition 13 [ST, p74] 

If T denotes the set of intermediate fields, and Q the set of subgroups of 
the Galois group then we have two maps 

* - T ^ g 
-'-.g T 

which reverses inclusions and satisfies Lemma 12. 

A. 2. 2. 3 Group Order and Degree of Field Extension. There is also a rela- 
tionship between the degree of a field extension and the order of the group of 
automorphisms associated with the extension. 

Theorem 14 [ST, p91] 

1. Let G be a finite subgroup of the group of automorphisms of a field K, and 
let Kq be the fixed field of G. Then [K : Kq] = |G|. 

2. If G is the Galois group of the finite extension L : K and iJ is a finite 
subgroup of G then [H'.K] = [L : K]/\H\. 

Gonsider the example of Q(\/2, V5) . Q = F : Q detailed above. 

[L:Q] = [L : Q(V2,V3)][Q(y2,y3) : Q(y2)][Q(\/2) : Q] 

= 2 . 2.2 = 8 . 

Let Z = 7 (L : Q) then \Z\ = 8. We have Z D S = e, (34) and [S'! = 2. Now 
the fixed field of S' is 5 = Q(-\/2, -\/5) and [S:Q]=[Q(-\/2, -\/5) : Q(-\/2)] [Q(\/2) : 
Q] = 2.2 = 4. So 4 = [S : Q] = [L : Q]/|S| = 8/2 = 4. 

Another example is supplied by the field extension for u{x) = — 2. 

Let G = F{Q{y, z) : Q) and FI = {e, 6*, 0'^} so \H\ = 3. 

Also [Q(i/,z) : Q] = [Q{y,z) : Q(?/)][Q(y, 2 ;) : Q] = 3.2 = 6. 

The fixed field of H is H = Q{y), [ : Q] = 2. So 

2 = [H:Q] = [Q(y, z) : Q]/\H\ = 6/3 = 2. 




A. Project Example 1: Topics in Galois Theory 



177 



In the above two examples the degree of the field extension that gives the 
splitting field is equal to the order of the Galois group of the extension. This 
is formally stated with 

Theorem 15 [ST, plOO] 

If L : AT is a finite separable normal extension of degree n then there are 
precisely n distinct AT-automorphism of L, so that |T(T : K) \ = n. 

Suppose that a separable extension L : K is not normal. It can be shown 
[ED, pp98-I02] that a ‘normal closure’ that enlarges the extension L : K can 
be constructed, and that Theorem 15 will still hold for L : K. A normal closure 
is simply an extension N of L such that N : K is normal, and N is the smallest 
such extension of L which is normal over K. 



A. 2.3 Applying Galois Theory 

A. 2. 3.1 The Galois Correspondence. We can now give a formal statement of 
the relationship between a field extension and its Galois group. This is known 
as the Galois correspondence and is the fundamental theorem of Galois theory 
in its modern algebra setting. 

Theorem 16 [ST ppl04-105] 

If L : it" is a finite separable normal field extension of degree n, with Galois 
group G and if T, Q, *, ' are defined as in Section A. 2. 2 then 

1. The Galois group has order n. 

2. The maps * and " are mutual inverses and set up an order reversing 1 — 1 
correspondence between and Q. 

3. If M is an intermediate field then [L : M\ = |M*| and [M : K\ = |G|/|M*|. 

4. An intermediate field M is a normal extension of K if and only if M* is a 
normal subgroup of G. 

5. If an intermediate field M is a normal extension of K then the Galois group 
of M : AT is isomorphic to the quotient group G/M*. 

We can show that the Galois correspondence of Q(y/2, -\/3, -\/5) : Q = L : Q 
as follows (refer to Section A. 2. 2 for the details): 

1 . [L-Q] = s=\Z\. 

2. L = e, Q(v^,v^) = f, Q(^/2) = E, Q = Z, e = L*, T = 
Q(V2,V3)*, E = Q(V2), Z = Q*. 
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3. [L :_Q(y2,^/3)] = 2 = \T\, [Q(^/2V3) : Q] = 4 = \Z\/\T\, [L : 

Q(V2)] =4=\E\, [Q(y2) : Q] = 2 = \Z\/\E\. 

4. Z is abelian so every subgroup of Z is normal. 

Therefore Q(-\/2) : Q, Q(\/2, -s/S) : Q(-\/2) and L : Q(V2, -\/3) are all 
normal extensions. 

5. Let V = {(12),(12)(34),(12)(56),(12)(34)(56)}. Then Z/E = {E,V} 
which is isomorphic to T(Q(-\/2) : Q) = i? = {e, (12)}. Let 

1T = {(12),(12)(56)}, X = {(34),(34)(56)}, T = {(12)(34), (12)(34)(56)| 

then Z/T = {T,W,X,Y} is isomorphic to T(Q(-\/2, -\/3) : Q) = C = 
{e,(12),(34),(12)(34)|. 

The field extension for u(x) = — 2 also satisfies the Galois correspondence, 

with Q(y) being an intermediate field. Q(y)* = {e, and is a 

normal subgroup of S 3 . 

A. 2. 3. 2 Radical Extensions. Although field extensions are the ‘adjunction of 
quantities’ that Galois proposed, the modern expression of Galois Theory re- 
lies on starting with a splitting field and analysing the behaviour of the Galois 
group as elements of the splitting field are fixed. In practice, as we are unaware 
of the elements in the splitting field, we rely entirely on studying the nature of 
this field and its associated Galois group. That is, ensuring that our splitting 
field satisfies the conditions in Theorem 16 and analysing the solubility of the 
group. However, we have not yet imposed any conditions on the em type of 
elements by which we extend the ground field. If we wish to show that a poly- 
nomial can be solved, then we must have a “radical extension” . 

Definition 17 [ST, pl28] 

An extension L : K is radical if L = AT(ai, . . . , a„) where for each i = 
1, . . . , TO there exists an integer n{i) such that 

G AT(ai, . . . , Oj-i) i>2. 

The elements ai are said to form a radical sequence for L : K. 

For example if L : Q(\/2, -\/3) then (-\/3)^ = 3 G Q(-\/2). 

So a radical extension is nothing more than a series of adjunctions of pth 
roots as detailed by Galois. It is at this point we concentrate on fields of charac- 
teristic 0 since the proofs of some of the following items contain this assumption. 
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Theorem 18[ST, ppl29-130] 

1. If L : iT is a radical extension and M is a normal closure oi L ■. K then 
M : K is radical. 

2. If K has characteristic 0 and L : K is normal and radical then F{L : K) is 
soluble. 

3. If AT is a field of characteristic 0 and K C L C M where L : K is a soluble 
group. 

Consider Q(-\/2, -\/3, -\/5) : Q. This is, by definition, a radical extension. There- 
fore we should expect its Galois group, Z, to be soluble. This is confirmed in 
Section A. 1.4 

Q(\/2, -\/3) : Q is less obvious. We know that y and z are expressible in 
radicals so the splitting field must be a radical extension. In general, however, 
we may not know whether an extension is radical. In such a case we can turn 
to a theorem which is almost the converse of Theorem 18. 

Theorem 19 [ST, pl46] 

Let K he a field of characteristic 0 and let L : AT be a finite normal extension 
with soluble Galois group G. Then there exists an extension R oi L such that 
R : K is radical. 

So, since we have already shown that S 3 is soluble, we can state the exten- 
sion Q(y, z) : Q is radical and u is soluble by radicals. 

For fields of characteristic p > 0 we must impose an additional condition to 
ensure that a group implies an equation soluble by radicals. In addition to the 
adjoining of radical elements as defined in Definition 17 we must also adjoin 
elements a such that a^—ais in the given field where p is the characteristic. The 
adjunction of such elements extends the proof of Theorem 19 to allow inclusion 
of fields of characteristic p > 0. Otherwise, we may have some polynomials with 
soluble Galois group that are not soluble by radicals. 

A. 2. 3. 3 The General Polynomial of degree n. We have so far considered only 
finite algebraic extensions. To tackle the general polynomial equation 

g(x) = x” - CTix”“^ -P . . . -P (-l)”cr„ 

(where the coefficients are written in terms of the elementary symmetry poly- 
nomials (Jfc) we will need to use a transcendental extension. 

Definition 20[ST, pl39] 

1. An extension L : K is finitely generated if L = K{a\, . . . ,an) where n is 
finite. This is true whether the ai are algebraic or transcendental over K. 
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2. If are transcendental elements over a field K, all lying inside 

some extension L of K then they are independent if there is no non-trivial 
polynomial p over K (in n indeterminates) such that p{ti, . . . ,tn) = 0 in 

L. 

3. Let ti,. .. ,tn be independent transcendental elements over K. The sym- 
metric group Sn can be made to act as a group of iL-automorphisms of 
K{ti, . . . ,tn) by defining 6{ti) = tga) for all 9 G Sn- Distinct elements give 
rise to distinct iL-automorphisms. 

As we would expect, the fixed field of consists of the quotients of the sym- 
metry polynomials in the ti - In particular it contains the elementary symmetric 
polynomials ak, so if F is the fixed field of Sn we have F = K{ai, , cr„). 

Lemma 21 [ST, pl43] 

With the above notation, are independent transcendental ele- 

ments over K. 

Now, if we consider the ti to be the n roots of the general polynomial of 
degree n over AT(cri, . . . , ct„) where 

g{x) = x” - -I- ... -I- (-l)”cr„. 



Thus we have 
Theorem 22 [ST, pl43] 

For any field K let g be the general polynomial of degree n over K and let 
A be a splitting field for g over K{a \, . . . , cr„). Then the roots ti, . . . , of g 
in S are independent transcendental elements over K, and the Galois group of 
E : K{a\, . . . ,Un) is the full symmetric group Sn- 

Corollary 23 

The general polynomial of degree n has Galois group which is not soluble 
for n > 5. 



A. 3 Soluble Groups 

A. 3.1 Normal Subgroup Series 



In Section A. I we showed that a soluble group has a series of normal subgroups 
of index p (for p prime) and this series terminates with the identity element. 
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A more general definition is available which does not confine solubility to the 
action of adjoining pth roots. 

Definition 1 [ST, pp 114-115] 

A group B is soluble if it has a finite series of subgroups 

B = Bo2 Bi 2 ■■■2 B„ = e 

called a composition series, such that 

1. Bi+i < Si for z = 0, . . . , n — 1; 

2. Bi/Bi^i is abelian for z = 0, . . . , n — 1. 

Subgroup normality is not transitive so Bi < i?i_i < Si -2 ^ Bi a 

Note that if we compare this to Galois’ definition of a soluble group then 
\Bi/Bi^i\ = p for a prime number p. Therefore B^/Bi^i is abelian since all 
groups of prime order are cyclic and hence abelian. 

We know that from the examples in Section A. 1.2 that the decomposition 
of a group into subgroups is not necessarily unique. If we have more than one 
way of composing a series of normal subgroups, we need to be sure that each 
different series satisfies the definition of a soluble group. The Jordan-Holder 
Theorem gives 

Theorem 2[AL, p263] 

Let B be a finite group with composition series 

B = BqD BiD ...D Bn = e 
and B = Co2Ci2 ■■■2 Cm = e. 

Then m = n and the r quotient groups Bij Bij^i can be put in 1-1 correspon- 
dence with the r quotient groups CijCi+i in such a manner that the correspond- 
ing groups are isomorphic. We will not necessarily have B^/Bi^i = CJCi+i 
but all the quotient groups can be paired off. Provided we can construct one 
series that satisfies Definition 1 then every series will also satisfy the definition. 



A. 3. 2 Normal Subgroups 

As we have seen in Section A. 3.1 the solubility of a group depends on the 
normality of the subgroups. Abelian groups are useful since every subgroup 
of an abelian group is normal, and the quotient of an abelian group is always 
normal. This means that every abelian group is soluble and therefore every 
cyclic group is soluble since they are all abelian. 
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In Section A.l we mentioned that one of Galois’ examples of a soluble 
equation was the cyclotomic equation 

- 1 „-i 

— = + . . . + a; + 1 

X — 1 

where p is prime. It is clear that one of the roots of — 1 is 1 so we are 
interested only in the roots of x^~ ^ + . . . + x + 1 . This is an irreducible equation 
with simple roots. Note that if a is one of these roots then all the roots can be 
expressed as a* for i = 1, . . . — 1. So the splitting field for this equation is 

Q(a) and the Q-automorphisms are 



(j)i : 


a - 


a 


4>2 '■ 


a - 


a 


/>S '■ 


a - 


a 


-1 : 


a - 


a 



The Galois group is abelian since for any two Q-automorphisms we have 
4>i{a)(j>j{a) = = d‘~^^ = a-' a* = (j>j{a)(f>i{a) 



so the Galois group is soluble, Q(a) is a radical extension and all pth roots of 
unity are expressible in radicals when p is prime. Since we have already shown 
that any nth root, where n is not prime, can be constructed by using pth roots 
when p is a prime number, this shows that the cyclotomic equation x" — 1 is 
soluble by radicals for any n. The following theorem is useful when we have 
partial knowledge of the structure of a group. 



Theorem 3 [ST, pi 16] 

Let G be a group, H a subgroup of G, and N a normal subgroup of G. 

1. If G is soluble then H is soluble; 

2. If G is soluble then G/N is soluble; 

3. If and G/N is soluble then G is soluble. 

Every subgroup iL of a group G, where H has index 2 in G is a normal subgroup. 
A particular family of such subgroups is the alternating group A„ which has 
index 2 in the symmetric group Sn- So A„ is always a normal subgroup S'„. 
However, for n > 5 this information, as we might expect, is not much help. 
This leads us on to our next section. 
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A. 3. 3 Simple Groups 

Theorem 4 [ST, ppll7-118] 

1. A group G is simple if its only normal subgroups are e and G. 

2. A soluble group is simple if and only if it is cyclic of prime order. 

3. If n > 5 then the alternating group A„ of degree n is simple. 

The importance of this last result is highlighted by 

Corollary 5 [ST, pi 19] 

The symmetric group S'„ of degree n is not soluble if n > 5. 

Proof 

If Sn were soluble then A„ would be soluble. But if n > 5 A„ is simple 
and of order n!/2, which is not prime for n > 5, and therefore not soluble. This 
shows that for n > 5 S'„ is not soluble. 

□ 



A. 3.4 p-Groups 

These are another example of soluble groups. We have 
Lemma 6[ST, pl21j: 

1. Let p be a prime. A finite group B is a p-group if its order is a power of p. 

2. If B is a finite p- group of order p” then B has a series of normal subgroups 

B = BqD Bi2 . . .A Bn = e 
such that \Bi\ = p”“* for alH = 0, . . . , n. 

3. Every finite p-group is soluble. 

Proof (of (3)) 

Every quotient group has order 

|i?.|/|i?.+i|=p"-Vp"-(*+i)=p 

which is prime. Therefore every quotient group is cyclic and abelian. As every 
subgroup is normal, this implies B is soluble. 

□ 
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The theory of p-groups was extended by Sylow who gave the following theorem 
Theorem 7 [ST, pl23] 

Let -B be a finite group of order where p is prime and does not divide 
r. Then 

1. B possess at least one subgroup of order p“; 

2. all such subgroups are conjugate in B; 

3. any p-subgroup of B is contained in one of order p“; 

4. the number of subgroups of B of order p“ leaves remainder 1 on division 
by p. 

If the Sylow p-subgroup P of B is a normal subgroup of B, and B/P is abelian, 
then G is soluble, since P is soluble. 

Bs is a group of order 120 = 2^. 3. 5. So B 5 should have Sylow p-subgroups 
for p = 2, p = 3 and p = 5. Let P 2 be a Sylow 2-subgroup of B 5 then IP 2 I = 8 . 
Using similar notation we also have jPaj = 3 and jPsj = 5. 

Allenby gives a more detailed version of part 4 in Theorem 7 : 

4 The number of conjugates of a Sylow p-subgroup of B divides |B|/p“ and 
is congruent to l(modp) 

Consider S 3 which has order 6 = 2.3. S 3 has one Sylow 3-subgroup which is 

^ 3 - 

B 3/3 = 2 and 1|2 1 = l(mod3). 

S 3 has a Sylow 2-subgroup which is A = {e, (12)} where X = {e, (13)} = 
}e,(23)} 

S 3/2 = 3 and 3|3 3 = l(mod2). 

Both A 3 and X are normal in S 3 . 

The example from Section A.2 of Z = P(Q(-\/2, -\/3, ^/E) : Q) is an example 
of a p-group since jZj = 8 = 2^. So, as we expected, Z is a soluble group. 



A. 4 Geometrical Constructions 



In this section we study an interesting application of field theory to classical 
geometry. Since ancient Greek times mathematicians have been interested in 
the construction of geometrical figures using only a straight edge and compass. 
With this restriction we are allowed two operations: 
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1. drawing a line through two given points; 

2. drawing a circle with centre at one point and passing through a second 
given point. 

Many figures have proved to be constructible under these restrictions but not 
all. Three problems were of particular interest to the ancient Greeks and subse- 
quent generations of mathematicians over the following centuries. These prob- 
lems are [DU, p264]: 

1. The Duplication of the Cube-given a cube construct another cube with 
twice the volume of the given cube. 

2. The Trisection of an Arbitrary Angle-some angles can be trisected but a 
method is required for the trisection of any angle. 

3. The Quadrature of the Circle-given a circle construct the square that has 
the same area as the given circle. 

Not until the 19th century was it proved that these three problems have no 
solution. The “impossibility proofs” depend on algebraic techniques rather that 
geometry. We shall use field theory, particularly that of field extensions, to prove 
the impossibility of these three constructions. 



A. 4.1 Constructible Points 

We need to find a way of expressing geometrical construction in algebraic terms. 
We can assume that we are given at least two points. Let the set of given 
points be Pq C and let pi,P 2 & Po- Then we can use our two operations to 
construct: 

1. a line through pi and p 2 ', 

2. a circle with centre pi and radius piP 2 ', 

3. a circle with centre p 2 and radius piP 2 - 

We add points to Pq by adjoining the intersection points of such lines and 
circles. Let ri be one of these intersection points. Then ri is constructible in 
one step from Pq. We define a point r in to be constructible from Pq 
if there is a finite sequence ri,r 2 ,...,r„ = r of points in R^ such that for 
each i = 1, 2, . . . , n the point is constructible in one step from the set Pq U 
{ri,...,r*_i}[ST, p52j. 

Let Kq be the subfield of R generated by the x and y coordinates of the 
points in Pq. Let be an intersection point with coordinates (xi,yi). Then 
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Ki is the subfield of R generated by adjoining Xi and yi with Ki-\ i.e. Ki = 
Ki^i{xi,yi). So 

-ftlo C itii C . . . C C R. 

With this notation we have 

Lemma 1 [ST, p53] 

Xi and yi are roots in Ki of quadratic polynomials over Ki-\. 

Outline proof 

The point Xi is an intersection point of either two lines, two circles, or a line 
and a circle all constructed from points of Pi-\. If Xi is the intersection point 
of 

1. Two lines then Xi^yi € Ki_i which implies Xi,yi G Ki. Also Xi and yi are 
certainly roots of quadratics polynomials over Ki_i, 

2. Two cixcles then we obtain quadratic expressions for x and y with coeffi- 
cients in Ki-i; 

3. A cixcle and a line then again we have quadratic expressions for x and y 
with the same result as in (2) above. 

The following theorem is the method by which we will prove the impossibility 
of the three constructions mentioned at the start of the section. 

Theorem 2 [ST, pp54-55] 

If r = {x, y) is constructible from a subset Pq of R^ and if Kq is the sub- 
field of R generated by the coordinates of the points of Pq, then the degrees 
[Ko{x) : Kq] and [Ko{y) ■ Kq] are powers of 2. 

Proof 

If Xi is the intersection of two lines then Xi,yi G Ki-i and [Ki-i{xi) : 
= [K,_^{yi) : K,_i] = 1 

If Xi is the intersection of two circles or of a circle and a line then Xi and yi 
are the roots of quadratic polynomials over Ki_i. If the quadratic polynomials 
are reducible then the result is the same as for the intersection of two lines. If 
the quadratic polynomials for Xi and yi are irreducible then 

[K,_i{x,) : K,_i] = [K,-i(y.) : = 2. 



So we can write 



[Ki_i{xi,yi) : ATj_i] = [Kt_i{yi) : Ki_i{xi)][K^-i{xi) : K^-i] = 1,2 and 4 
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If [Ki : Ki-i is a power of 2 then by induction \Kn : Kq] is a power of 2. 
But 

[iT„ : Ko{x)\ = [Ko{x) : Ko] = [X„ : Ko] 

so [Ko{x) : Kq] is also a power of 2. Similarly, [Ko{y) : Kq] is a power of 2 [ST, 

p55], 

□ 



A. 4. 2 Impossibility Proofs 

We can assume that our given points Pq include a pair of perpendicular axes 
and the unit circle so that {(0, 0), (1, 0)} G Pq- The following three proofs show 
that it is not possible to construct these figures using only straight edge and 
compass. 

1 . Duplication of the Cube Without loss of generality, we can assume that 
the given cube is the unit cube and that one side is the segment of the x-axis 
(0,0) to (1,0). Hence we wish to construct the cube with volume 2 units and 
with length of side a such that = 2. If we could construct the point {a, 0) 
then [Q(q;) : Q] would be a power of 2. But a has a minimum polynomial x^ — 2 
over Q which is irreducible over Q. So [Q(a) : Q] = 3 and therefore (a, 0) is 
not constructive [ST, pp55-56]. 

2. Trisection of an Arbitrary Angle We will show that the angle tt/3 can- 
not be trisected. We wish to construct the angle 7t/9 which is equivalent to 
constructing the point (a, 0) where a = cos(7t/9). Recall that for any angle 
A, cos 4 = 4cos^(t1/3) — 3cos(4/3), so if A is tt/3 this gives 

4cos^(7t/9) — 3cos(7t/9) = 1/2 

8a^ — 6a — 1 = 0. 

Now 8x^ — 6x — 1 is irreducible over Q therefore Q(a) : Q = 3. This contra- 
diction shows that (a, 0) cannot be constructed ([DU, p265] and [ST, p56[). 

3. Quadrature of the Circle Without loss of generality we can assume that 
the given circle is the unit circle. So we wish to construct the square with area 
7T. This is equivalent to constructing the point (y^, 0). If we can construct 
(-y/TT, 0) then we can certainly construct (tt, 0) (constructing the square of a 
number is demonstrated in Section A. 1.3) so that [Q(7t) : Q = 2™ for some 
TO > 0. But TT is not algebraic over Q (by Lindermann’s Theorem) and therefore 
[Q(7t) : Q yf 2™ which shows that we cannot construct (tt, 0) and therefore by 
implication (-y/^, 0) [ST, p57]. 
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A. 4. 3 Performing Algebraic Operations by Construction 

We can now give demonstrations of constructions where we are carrying out 
an algebraic operation using the method of geometrical construction. 

1 Constructing the square of a number Given the point a we construct by 
drawing the line from (0, a) to (1, 0) then constructing the line parallel to this 
line and passing through (a,0). By similar triangles the point (0,6) at which 
the line intersects the vertical axis is since 

a b 
1 a 

2 Constructing the square root of a number. Given the point 6 we construct 
\/6 by drawing the circle which has diameter (—1,0) to (6,0). This circle will 
intersect the vertical axis at (0, a). By the intersecting chords theorem = Ixb 
so that a = Vb. 



A. 4. 4 Regular n-gons 

In 1796, at the age of 18, Gauss found a construction by straight edge and 
compass for the regular 17-gon. This was the first new such polygon for 2,000 
years. Prior to this it was known how to construct regular n-gons for only the 
following values of n [AL, pi 74]: 

2\ 2G3, 2G5, 2M5. 

In 1801 Gauss gave the following theorem: 

Theorem [ST, pl69] 

The regular n-gon is constructible by straight edge and compass if and only 
if n = 2’’pi . . .ps where r and s are integers greater than 0 and pi . . .ps are odd 
primes of the formpi = 2^" -Pl for positive integers rt (proof: [ST, ppl69-170]). 

The Pi are known as Fermat numbers since they were discovered by Fermat 
in the 17th century. If = 2^" + 1 then Fq = 3, Fi = 5, F 2 = 17, F 3 = 
257, F 4 = 65,537. These are all primes and Fermat conjectured that Fn is 
prime for all n. This was disproved in 1732 by Euler when he showed that F5 
is not a prime. This leads us to 

Proposition [ST, pl70] 

The only primes p < fQj. which the regular p-gon is constructible 

are 2, 3, 5, 17, 257, 65537. 
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1 a 



Figure A.l Constructing the square of a number 

If we wish to construct a p-gon where p is prime we can make use of the 
cyclotomic equation 

— = +... + X+1. 

a; — 1 

The roots of the right hand side of the equation are where = cos(2fc7r/p) + 
i sin(2kTT / p) . 

As an example we will find a construction for the regular pentagon. If we 
assume that the pentagon has centre (0,0) and its first vertex is at (1,0) then 
we wish to construct the angle 6 = jh which is equivalent to constructing 
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Figure A. 2 Constructing the square root of a number 



the point (a, 0) where a = cos 6*. 

The primitive roots of a;® — 1 are oi, 02, 03, 04 where 01 + 02 + 03 + 04 = — 1 
(an elementary symmetric polynomial). We also need 



d5-k + flfe = 



cos(5 — k)9 + cos k9 + i sin(5 — /c)6i + i sin k9 
(5 — 2k)TT 
5 



= 2cos(27t)cos 



0 



= 2cos(2fc7r/5) 

= 2 cos k9. 



So oi + 04 = 2 cos 9 and 02 + 03 = 2 cos 29. 
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Putting these values into the elementary symmetric polynomial we have 

2cos0 +2cos26* = — 1 

4 cos^ 6 + 2 cos 0—1 = 0 
^ a^ + 2a-l = 0. 

Taking the positive root of this quadratic we have 

75-1 

a - - . 

In the demonstrations in Section A. 1.2 we showed how to construct the square 
root of a number so, with the construction of rational numbers and one square 
root, we can construct the angle 9 = 27t/ 5 to give the second vertex of the 
pentagon. 

Gauss gave the following algebraic expression for constructing the 17-gon 
[ED, p32]: 

cos(27t/ 17) = -^+ i^\/34-27l7 




This can be derived from the cyclotomic equation of degree 17 by forming equa- 
tions of the roots similar to the one we used for the pentagon above although 
more complicated [ST, ppl71-173]. Research into constructions of n-gons has 
continued and methods have been found for the 257-gon. There has been some 
research into the 65,537-gon, with Richelot publishing a paper as early as 1832, 
although Stewart is not clear on whether this resulted in a satisfactory construc- 
tion [ST, pl70]. It is difficult to imagine the complexity of solving trigonomet- 
ric equations in the 65,536 primitive roots of cyclotomic equation, particularly 
without the advanced computer packages now available to us. 
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Epilogue 

The assessors for this project were unanimous in awarding this student a first. 
The overall standard of the mathematics was very high, and the student obvi- 
ously understood the sometimes quite difficult material. This was obvious from 
both the oral presentation as well as at the viva-voce. The style of the presen- 
tation is mature and the student’s judgement on what to include and what to 
leave as a reference is nigh perfect. The layout is good (although this is less 
obvious here as it has had to change to fit the I^TpfK style of this text) and 
leaves the reader no doubt that the student had read and understood the books 
referred to. 

The only criticisms were the odd technical lapse and typographical error 
(many have been left in-maybe you can spot them). 

The student took the responsibility to read and understand about Galois 
Theory. This is a commitment and involves being willing to devote a great 
deal of time to reading, re-reading and understanding books that can be quite 
advanced. It also meant doing many examples. Therefore, much time needs to 
be put aside which is consistent with the amount the project counts (one sixth of 
the final year in this instance). The self confidence of the student is also obvious 
from the writing, and this indicates a high level of mathematical ability. 




B 

Project Example 2: Algebraic Curves 



Preamble 

This projects takes further a topic with which mathematics students are famil- 
iar, namely graphs of functions. As in Appendix A, the subject is well embedded 
in the literature, but here the technicalities are less. There are places where the 
student has to branch out into the unknown (for the student), but much of the 
project is putting what is reasonably familiar into more rigorous mathematical 
terms. 



Abstract 

This project is an investigation into the theory of algebraic curves. We start 
by considering the basic definition and characteristics of an algebraic curve. 
We consider how an algebraic curve lies in a projective space and this leads 
on to studying the intersection of two curves. The properties of some simple 
curves and how they intersect is considered in more detail including the group 
structure of elliptic curves. We also investigate some results from complex anal- 
ysis and subsequently use them to define Riemann surfaces and explain their 
connection with algebraic curves. This leads to more advanced theories from 
complex function theory such as integration on a curve. The topology of alge- 
braic curves as subsets of is also considered. 
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Introduction 

Algebraic curves have a history of over 2000 years and it is inevitable that the 
Greeks made the first developments and discoveries. The Greeks had a limited 
algebraic knowledge but good geometric methods and to Euclid a circle was not 
defined by an equation but by the locus of all points having an equal distance 
from a fixed point. Greek mathematics was virtually ignored in Western Europe 
until it was rediscovered in the Renaissance period. The Arabs were responsible 
for reintroducing Greek mathematics into Western society and also brought 
with them a sophisticated algebraic knowledge and a good algebraic notation. 

In the late seventeenth century a more algebraic approach was being applied 
to algebraic curves and a large number of mathematicians published papers on 
the subject. These included Newton who made a study of cubic curves and 
identified and classified 72 different cases. By the nineteenth century it was 
soon apparent that considering curves as subsets of was easier than looking 
at curves as subsets of as a curve in the Euclidean plane can degenerate so 
much that it no longer resembles a curve. 

In the nineteenth century mathematicians such as Riemann were interested 
in more abstract geometrical spaces and this led to his theory of Riemann sur- 
faces and their connection with algebraic curves. Research into algebraic curves 
continues up to the present day particularly in the area of particle physics. Here 
the ‘lifetime’ of a particle is represented by a ‘string’ in four dimensional space- 
time. This can consequently be represented by a Riemann surface. 

Algebraic curves is one of the most varied and beautiful subjects in mathe- 
matics. This project does not have a specific aim and the discussion is relatively 
informal, but it does give an insight into some of the more interesting and ap- 
pealing results that have arisen. As one can imagine, for a subject that has 
existed for such a long time, the theory of algebraic curves is vast and involves 
a lot of interplay from different areas of mathematics such as algebra, topol- 
ogy, complex analysis and geometry. It would therefore be foolish to try and 
incorporate every idea in the subject: consequently a lot of very elegant and 
compelling theories have been missed out; these include the Riemann-Roch 
theorem and an in-depth investigation into singularities. 

In Section B.I we show that an algebraic curve is defined by a polynomial 
and look at why we consider curves in C^. We also define singularities and 
consider the multiplicity of such points. In Section B.2 we investigate how two 
curves intersect by considering one of the most famous theorems in the study 
of algebraic curves: Bezout’s theorem. Section B.3 applies results from previous 
sections to some simple curves namely conics and cubics. We shall see that a 
cubic curve gives rise to some amazing properties such as the additive group 
structure. Section B.4 is a resume of results from complex analysis that we need 
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to be familiar with for the final chapters. 

In Section B.5 we investigate the topology of curves in which is a space 
that has four real dimensions. We find that these objects are topological surfaces 
and we consequently introduce the idea of an abstract Riemann surface to 
represent an algebraic curve. Section B.6 is a less rigid investigation into more 
advanced ideas involving algebraic curves and complex analysis which hence 
leads on to topics from complex function theory. 

Throughout the course of the project three books were heavily relied upon 
for the theorems and subsequent proofs. These were Frances Kirwan Complex 
Algebraic Curves, Bak and Newman Complex Analysis and R. J. Walker, Al- 
gebraic Curves and particularly Bak and Newman in Section B.4 and Kirwan 
in Sections B.5 and B.6. 



B.l Basic Definitions and Properties 



In this section we will look at some basic properties of algebraic curves. We also 
investigate the concept of a projective space and why it is useful to consider 
curves in the projective plane as well as curves in C^. 



B.1.1 Complex Algebraic Curves and Real Algebraic 
Curves 

A complex algebraic curve C in is defined by 

C={{x,y)eC^ ■.P{x,y)=0} 

where P{x,y) is a polynomial in two variables with complex coefficients. All 
complex algebraic curves are subsets of namely 

C={{x,y)eR^-.P{x,y) = 0} 

where P{x,y) has real coefficients. 

For every real algebraic curve there is an associated complex algebraic curve 
defined by the same polynomial. Investigating curves in the complex space is far 
simpler than only considering curves in the Euclidean space . If we work with 
complex numbers every polynomial factorises completely in C (not the case in 
R). This result from complex analysis, known as the fundamental theorem of 
algebra (see B.4. 31), allows us to completely factorise any polynomial, so for 
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simplicity in the remainder of the project we will only usually consider curves 
as subsets of C^. 

The degree of a curve C is given by the degree of the polynomial that defines 
C. Hence the curve defined by P{x,y) = x{x‘^ + 1) has degree 3. 

The polynomial P{x\, . . . , Xn) = 0 is homogeneous of degree d if and only if 

P{txi, tx„) = t‘^{xi, ...,Xn) 

for alH, Xi, . . . , x„ and some d, i.e. P is homogeneous if every one of its terms 
are of the same degree d. 

A polynomial is irreducible if it has no factors apart from constants and 
scalar multiples of itself. A curve defined by P{x,y) is irreducible if P{x,y) is 
irreducible. 

So far we have only considered curves in C^, but a curve in is never 
compact as it always has branches going to infinity. We can compactify a curve 
in by adding in the “points at infinity” and getting a projective space. It 
can be beneficial to consider curves in a projective space, for example consider 
two parallel lines which lie in C^. They will of course never intersect but if 
we add a point at infinity to each line they meet at a distinct point (albeit at 
“infinity” for this example). Consequently any two lines in a projective space 
will meet at a distinct point. For this reason it is useful to consider curves in a 
projective space studying their algebraic properties. 



B.1.2 Projective Spaces 

B. 1.2.1 Complex Projective Plane. Consider the set of all possible complex 
lines which pass through the origin of C^. It can be shown that the set of 
all such lines form a complex surface (4 real dimensions). The surface that is 
formed is known as the complex projective plane and denoted by CP 2 . This 
idea can be generalised to give a projective space for any complex dimension 
n. 

The complex projective space CP„ of dimension n is the set of complex 
one dimensional subspaces of the vector space When n = 2 we get the 

complex projective plane CP 2 as above. 

To prove results algebraically we need a notation to specify points in CP 2 . 
Clearly a line through the origin of is uniquely determined if we choose any 
point (save the origin) which lies on that line. So a point p in CP 2 consists of 
the unique line in which passes through (0, 0, 0) and (a, b, c) where a, b and 
c are not all zero. The notation for such a point p is given by [a, b, c]. 

We can define the points in CP 2 by 

CP 2 = { [a, b, c] : (a, b, c) € - {0}} 
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and also [a, b, c] = [d, e, /] if and only if a = Ad, b = Xe and c = Xf for some 
A € C — {0}. So far we have used the theory of three dimensional complex space 
to develop the idea of a projective space. However we want to use this space to 
study objects which have two complex dimensions, specifically algebraic curves. 
So we must find a way of associating figures in CP 2 with figures in C^. 

Suppose we have a projective figure which lies in CP 2 . We can place a 
plane in making sure it does not pass through the origin. The correspond- 
ing figure in are the points of the figure that pierces the plane. This is okay 
provided that each of the projective figures pierces the plane, but any point 
of CP 2 which consists of a line through the origin parallel to the plane that 
does not pierce the plane. Such a point is called an ideal point for that plane. 
In general a projective figure can be represented by the points which pierce a 
plane in together with some or all the ideal points for that plane. Using this 
idea we could have defined the complex projective plane as simply the usual 
complex plane with “ideal points at infinity” , one ideal point for each complex 
line through the origin. In the following example we will show how points in 
can be identified with points in CP 2 . 

Example 1.22 Consider the subset S of points in CP 2 which pierce the plane 
z=l: 

S = {[x,y,z] : Zy^O}. 

Each point (cc, y) G can be identified with the point (cc, y, 1) G S. Conversely, 
[x,y,z] = [x/z, y/z, 1] G S can be identified with {xjz,yfz) G C^. This gives a 
1-1 correspondence between and S. So can be regarded as a subset of 

CP 2 . 



B. 1.2.2 Projective Transformations. A geometry consists of a space together 
with a group of transformations that act on that space: projective geometry is 
no different. Since the points of CP 2 consist of lines through the origin of a 
projective transformation maps straight lines on to straight lines. Consequently 
transformations in CP 2 must be linear transformations of C^. If we define the 
map#:C3-{0} ^ CP 2 by ^(a, 6, c) = [a, b, c] then we can define a hyperplane 
in CP 2 as the image of U — {0} under ^ where U is a two-dimensional complex 
subspace of C^. 

Proposition 1.24 Given four distinct points Po,Pi,P 2 and q of CP 2 no three 
of which lie on a hyperplane, there is a projective transformation which takes 
Po to [1, 0, 0], Pi to [0, 1, 0], p 2 to [0, 0, 1] and q to [1, 1, 1]. 

Proof Let uq,ui,U 2 and v be elements of C — {0} whose image under # are 
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P 0 iPiiP 2 and q. So uq,ui,U 2 form a basis of C^. There is therefore a unique 
linear transformation a of taking uq,ui,U 2 to the standard basis 
(1,0,0), (0,1,0), (0,0,1). Now a{v) = (Aq,Ai,A 2) where Aq,Ai,A 2 are non- 
zero complex numbers. 

Hence the composition of a with a 3 x 3 matrix 

■ 1/Ao 0 0 

0 1/Ai 0 

0 0 I/A 2 . 

defines a projective transformation taking po to [1/Ai,0,0] = [1,0,0], pi to 
[0, 1/A2, 0] = [0, 1, 0], P 2 to [0, 0, 1/A3] = [0, 0, 1] and q to [1, 1, 1]. So when we 
are investigating a certain point on a curve we can assume that under a suitable 
projective transformation that the point can be a point of reference, namely 
[1,0,0], [0,1,0], or [0,0,1]. 



B.1.3 AfRne and Projective Curves 

Now we can define a projective curve in CP2 by a homogeneous polynomial 
P{x, y, z) in x, y and z. Thus a projective curve C is defined by 

C = {[a;, y, z] e CP2 : P{x, y, z) = 0}. 

Algebraic curves in as defined in Section B.1.1 will be called ajfine curves 
to distinguish them from projective curves. Affine and projective curves are 
closely related and it was shown that points in can be identified with points 
in CP2. This idea can be extended for affine and projective curves. 

Let P{x, y, z) = 0 be a curve in CP2 which does not have z = 0 as a com- 
ponent. The curve has an associated non-homogeneous polynomial P{x, y, 1) of 
the same degree. Consequently an affine curve P{x,y) in can be identified 
with z’^Piyxj z,yj z) in CP2 where d is the degree of the polynomial. 



B.1.4 Singular Points 

A singular point is basically a point where the curve does not look “smooth” . 
This idea can be best illustrated by looking at the sketches of algebraic curves 
in given at the end of this section. 

Mathematically, for an affine curve C a singular point or singularity is where 
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where P(x, y) = 0 defines C and (a, h) G C. 
Similarly for a projective curve, we have 



dP 

dx 



{a,b,c) 



dP 

dy 



{a, b, c) 



dP 

-^{a,b, c) = 0, 



where (a, b, c) G C. 



If a curve does not have any singularities it is termed non-singular i.e. + 
— 1 = 0. The multiplicity of an affine curve defined by P{x,y) at a point 
(a, b) is the smallest possible integer m such that 



dmp 

dx^dy^ 



(a,b) yf 0 



for some i < 0, j < 0 where i -\- j = m. 



Similarly for a projective curve the multiplicity of a point (a, b, c) is the smallest 
possible integer m such that 

dmp 

(a, 6, c) yf 0 where i j k = m. 
dx^dyidz'^^ > > 



The point is non-singular if its multiplicity is one. If this is the case it has a 
unique tangent line at (a, 6). A point is a double point, triple points, etc. A 
singular point is ordinary if the first non-vanishing terms expanded about this 
point have no repeated factors, it thus has m distinct tangent lines at {a,b). 
We will finish this section by looking at some types of singular points in R^. 
Example 1.41 



P{x, y) = x^ — x^ y^ see Figure B.l. 



The origin is a double point with two distinct tangents. This type of double 
point is called a node. 



Example 1.42 

P{x, y) = x^ — y"^ see Figure B.2. 

This double point is not ordinary as it has only one tangent line that runs along 
the a;-axis. This type of singularity is known as a cusp. Example 1.43 

P{x, y) = -I- — x^y"^ see Figure B.3. 

This is not an ordinary point of multiplicity four as it has two tangents at right 
angles running along the x and y axis. 

Example 1.44 

P{x, y) = y'^ — x^ — y'^Y ~ see Figure B.l. 

Here we have an ordinary point of multiplicity four with four distinct tangents 
which pass through the origin. 
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Figure B.l The curve = 0 

B.2 Intersection of Two Curves and Points of 
Inflection 



This section investigates whether two polynomials in x, y and 2 have any com- 
mon solutions, i.e. how do two projective curves intersect? We shall also look 
at points of inflection on a curve. 



B.2.1 Bezout’s Theorem 



The number of points of intersection between two projective curves defined by 
the polynomials P{x, y, z) and Q{x, y, z) is given by Bezout’s Theorem. If P is 
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Figure B .2 The curve = 0 

of degree n and Q is of degree m then P and Q have precisely nm common 
solutions. To prove this we first need the following algebraic result. 



B. 2 . 1.1 Resultant of Two Polynomials. We can rearrange P and Q so that 
P{x, y, z) = oo(y, z) + ai{y, z)x + 02(2/, z)x^ + . . . + a„(j/, z)x^ 

and 



Q{x, y, z) = bo{y, z) + 5 i(j/, z)x + 62(2/, z)x^ + . . . + 5^(2/, z)x'^ 
where Oi,bi G C. 

The resultant R{y, z) of P and Q is given by the determinant of 
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'ao(y,z) ai{y,z) ... a„,{y,z) 0 0 0 

0 ao(y,z) ai{y,z) ... an{y,z) 0 0 

0 0 ... 0 ao(y,z) a^{y,z) an(y,z) 

bo(y,z) bi{y,z) ... ... ... bm(y,z) 0 ... 0 

0 ... 0 bo{y,z) bi{y,z) bm{y,z). 

R{y, z) is thus a polynomial in y and ^ of degree nm. 

If P{x,y,z) and Q{x,y,z) are non-constant homogeneous polynomials we 
can arrange it so that 

P(1,0,0)^0^Q(1,0,0) 

which ensures that they have the same degree when regarded as polynomials 
in X with coefficients in y and . 2 . It follows that P{x,y,z) and Q{x,y,z) have 
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Figure B.4 The curve {x'^ + ~ = 0 

a non-constant common factor if and only if R{y, 2 ) = 0 and it always can be 
arranged so that this condition is true. Using this result we can now prove the 
weak form of Bezout’s theorem 

Theorem 2.12 

If two projective curves of degrees n and m have more than nm common points, 
then they have a common component. 

Proof [Walker 50, 3.1 p59] 



Let C and D be two projective curves of degrees n and m defined by the 
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polynomials P{x, y, z) and Q(x, y, z) that have more than nm common points. 
We can select any set of nm + 1 of these points and join each selected pair by 
a line. As there is only a finite set of such lines then there is a point p not on 
any of the lines or on C or D. Under a suitable projective transformations we 
can choose this point to be p = [1,0,0]. Consequently P and Q can be put in 
the form 

P{x, y, z) = ao(y, z) + ai{y, z)x + 02 ( 2 /, z)x^ + . . . + a„(p, z)x” 

and 

Q{x,y,z) = bo{y,z) + bi{y,z)x + b 2 {y,z)x^ + . . . + bm{y,z)x^ 

where ai,bj G C. The resultant R{y,z) of P and Q is either zero or a homo- 
geneous polynomial of degree nm in y and z. We deduce that R{b, c) = 0 if 
and only if there is an a such that P{a,b,c) = 0 = Q{a,b,c), that is to say 
the coordinates 6, c common to P and Q satisfy R{y, z) = 0. However each of 
the nm + 1 points has a different value for the ratio b : c since no pair of them 
are collinear with [1,0,0]. Hence R{y,z) = 0 and so C and D have a common 
factor, i.e. two curves intersect at nm points at most. 

In section B.l we defined the multiplicity of a curve. We use a similar idea 
to define the intersection multiplicity of two curves and use it to show that the 
number of intersections counted properly is precisely nm, the strong case of 
Bezout’s theorem. 

Theorem 2.13 

If C and D are two projective curves of degrees n and m in CP2 which have no 
common component then they have precisely nm points of intersection counted 
properly; i.e. 

Ip{C, D) = nm. 

peCnD 

The intersection multiplicity can be defined by using the resultant of two poly- 
nomials as defined in Section B.2.1. It follows that from our definition of the 
resultant that the intersection multiplicity Ip{C,D) for two projective curves 
C and H at a point p has the following properties: 

* Ip{C,D) = Ip{D,C). 

* Ip(C,D) = 00 if the point p lies on a common component of C and D 
otherwise Ip{C, D) is a non-negative integer. 

* Ip{C, D) if and only if the point p does not lie on C n H. 

* Two distinct lines meet with intersection multiplicity one. 
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* If C\ and C 2 are defined by the homogeneous polynomials P{x, y, z) 
and Q{x,y,z) and C is defined by R{x,y,z) = P{x,y, z)Q{x,y, z) then 
Ij,{C,D) = Ip{Ci,D) + Ip{C2,D). 

* If C and D are defined by P{x, y, z) and Q{x, y, z) of degrees n and m and 
E is defined by PQ + R where R{x, y, z) is homogeneous of degree m — n 
then Ip{C,D) = Ip{C,E). 

From this we can define the intersection multiplicity Ip{C,D) for some p = 
C n I? as the largest k where {bz — cy)^ divides the resultant R{y, z) and it can 
be shown that k is uniquely determined. Using this definition we can prove the 
strong case of Bezout’s theorem. 

Proof of Theorem 2.13 [Kirwan 92, 3.1 p62] Under a projective transfor- 
mation we can choose a point p = [1,0,0] such that p ^ C U D. p does not 
lie on a line containing two distinct points of C H D and p does not lie on a 
tangent line to C or D at any point of C n D. If C and D are defined by the 
homogeneous polynomials P{x,y,z) and Q{x,y,z) then the resultant R{y,z) 
is a homogeneous polynomial of degree nm in y and 2 ;. 

The resultant i?(y, z) can be expressed as a product of linear factors 

k 

- hyY" where c^, G C 
and Ci = nm = Ci -P 62 -P . . . -P e^. 

There exists a unique complex number such that CC\D = {pi : i <i <k} 
where Pi = [oj, c^j which leads to Ip^iC, D) = Ci and hence 

Ip{C, D) = nm 

pecnn 

proving the result. 

□ 



B.2.2 Points of Inflection on a Cnrve 

A point p on a curve C in CP 2 is a point of inflection of C when the tangent 
to (7 at p is not a component of C and meets (7 at p with a multiplicity of 
at least three. We want a way to compute such points: for this we use the 
Hessian curve. If (7 is defined by the homogeneous polynomial P{x,y,z) then 
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the Hessian H{x,y,z) of C is given by the determinant of the 3x3 matrix 



p 

XX 


p 

j- xy 


P 

xz 


p 

Jryx 


p 

^yy 


p 

J^yz 


p 

zx 


p 

-^zy 


p 

^ zz 



When H vanishes at a point of C then that point is an inflection point of C, 
for example if [a,6, c] lies on C and H{a,b,c) = 0 then [a,b,c] is a point of 
inflection (or flex) of C. The second derivatives of P{x, y, z) are of degree d—2. 
It follows that H must be a polynomial of degree 3(d — 2) in x, y and z. We can 
easily deduce from this and from Bezout’s theorem that a non-singular curve 
of degree greater than or equal to three has at least one point of inflection, i.e. 
when d = 2, H is a, polynomial of degree zero which no a,b,c could satisfy. 
When iJ = 0 or a constant then the curve that leads to this is defined as having 
no Hessian. 



B.3 Conics and Cubics 

To illustrate some of the results that were investigated in Section B.2 it would 
be useful to apply them to some simple curves. It can be easily seen that curves 
of order one, namely lines intersect at only one point and that every point on 
the line is a point of inflection, that is to say its second derivative vanishes. 



B.3.1 Conics 

A conic is a curve of degree 2 in CP 2 . Any non-singular projective conic C in 
CP 2 is equivalent under a projective transformation to the conic 

x^ = yz. 



B.3. 2 Cubics 

Cubics are curves of degree 3 in CP 2 . Cubics are the first class of curve where 
some more interesting properties occur. Firstly we will show that a non-singular 
cubic can be written in the standard form y"^ = g{x). 
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Theorem 3.21 

By a proper choice of coordinates any non-singular cubic C can be put in 
the form 

= g{x) (1) 

where g{x) is a cubic polynomial with distinct roots. 

Proof [Walker 50, p72] 

To prove this we need the result from Section B.2 that every non-singular 
curve of degree leq3 has at least one point of inflection. Under a projective 
transformation we can choose C to have a point of inflection at [0,0, 1], which 
reduces (1) to 

+ h{x, y) = 0 (2) 

where h{x,y) is of degree 2. Also h{x,y) must include a term at/^ (a not zero); 
if not [0,0, 1[ would be a singularity. Solving (2) for y we get 

y = ax + f3+ \/g{x). 

If we make the linear transformation y' = y — ax — (3 and x' = x we obtain 
y' + ax' -1-/3 = ax' + (3 + \/g{x). 

Squaring and dropping the dashes we get the required form y'^ = g{x). 

□ 

If we had a singular cubic then g{x) would not have distinct roots. If this were 
the case under a suitable projective transformation g{x) = a:^(a;-|- 1) for a nodal 
cubic or g{x) = x^ for a cuspidal cubic. We can now investigate one of the most 
famous properties of cubics. 



B.3.2.1 Inflections on a Cubic. We have the following theorem. 

Theorem 3.23 

A non-singular cubic C has 9 distinct points of inflection with the property 
every line joining two of them contains a third. 
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Proof[Walker 50, 6.6, p 72] 

Under the linear transformation x' = ax + b and y' = y, y^ = g{x) can be 
written 

F = y"^ — x^ — ax^ — bx = 0 

with 6(a^ — 46) 7 ^ 0 since x^ + ax"^ + bx = Q has distinct roots. We first look for 
points in the finite plane. The Hessian of F is given by 

H = {y^ + bx){3x + a) — {ax + 6)^. 

Eliminating y between F and FI gives 

p{x) = 3x'^ + Aax^ + 6bx^ — 6^ = 0. 

So p{x) has 4 distinct roots and p'{x) = 12(x^ + ax"^ + bx) and the resultant of 
p and p' is 

b^{a^ - 46)2 + 0. 

So for each x value there are two y values. None of these values of x satisfy 
x^ + ax^ + 6x = 0. So this makes eight points of inflection plus a flex at infinity 
(i.e. at [0, 0, 1] from the assumption made in Theorem 3.21) which makes a total 
of nine. If we choose two points of inflection in the finite plane to be [1, a, 6] and 
[ 1 , 0 , —6] say, then there must be a line that passes through these two points. 
If then we take the flex at infinity namely [0, 0, 1] then a line which contains 
the first two points must also contain this point. We can arrange the nine 
points of inflection into an array such that three points in any row, column or 
diagonal lies on a line in CP 2 . We now look at the result that connects conics, 
cubics, and Bezout’s Theorem. Let C and D be projective cubics defined by the 
polynomials P{x, y, z) and Q{x, y, z) and assume C and D meet at exactly nine 
points pi, . . . ,pg. Now let E be another projective cubic defined by R{x, y, z) 
where E contains the points pi, . . . ,ps- If C, D and E are linearly independent 
then there is a curve C(A, p, v) defined by 

XP{x,y, z) + pQ{x,y, z) + vR{x,y, z) = 0 for some 

which passes through the points pi,. . . ,ps and any two arbitrary points q and 
r say. We will show that this in turn leads to a contradiction and therefore if 
E contains pi, . . . ,ps then E contains pg. By Bezout’s Theorem, no line can 
contain four of these points as such a line would be a common component of C 
and D. Similarly no cubic can contain seven of these points. Now if pe,p 7 and 
Ps lie on a line L then pi ... ,pg lie on a unique conic Q. This conic is unique 
because if two conics lie on five common points they must have a common 
component (Bezout’s Theorem), the remaining components can only have one 
intersection not on this line, and hence the other four points must be collinear. 
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Figure B.5 The array in CP 2 

Now lei q & L and r ^ L U Q. C(A, /i, v) contains pi, . . . ,ps and A, p, v are 
chosen so that it contains q and r. So ^(A, p, v) must have L as & component, 
the other component is therefore Q. Hence C{\,p,v) = LU Q, but this is 
impossible since r ^ LUQ. Now ii pi, ... ,pQ lie on a conic Q then PT,ps lie on 
a line L. If we choose C(A, p, v) to vanish at the points q G Q and r ^ LU Q 
then this again leads to a contradiction since C{X,p,v) = LU Q. Finally if 
no three of pi , . . . , pg lie on a line and no six on a conic, we let L be the line 
where Pi,P2 lie and Q be the unique conic on which pg, . . . ,py lie. If we take 
q G L and r G L then this again leads to a contradiction since ps ^ Q and 
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C(A, = LUQ as before. This exhausts all possibilities so we can conclude 
that the curve ^(A, /r, v) does not pass through pi, . . . ,ps,q and r for any choice 
of q and r. So E is linearly dependent on C and D, and hence E must contain 
pg as well. We can use this result in the next section. 

□ 



B.3.3 Additive Group Structure on a Cubic 

Theorem 3.31 

Given any non-singular projective cubic C in CP2 and a point of inflection 
Po on C there is a unique additive group structure on C such that po is the 
zero element and three points of C add up to zero if and only if the intersection 
of C with some line in CP2 (allowing for some multiplicities). 

Proof [Kirwan 92 3.38, p 77] 

Firstly we have to show that additions and inverse operations are uniquely 
determined. Now —pg = pg since pg is a point of inflection and if p ^ pg then 
—p is the third point of intersection with some line in CP2 through p and pg. 
So additive inverses are uniquely determined. Now let p and q be any points of 
C then p + q = —r where r is the third point of intersection of C with a line 
in CP2 through p and q {if p ^ q) or the tangent to G at p (if p = g). So the 
addition operation is uniquely determined. 

Now we have to show that there is an additive group structure on C with 
Pg as the zero element. Commutativity, namely p + q = q + p, follows from the 
definition of the addition operation. Similarly p + (— p) = pg follows from the 
definition of additive inverses. We have to deduce that pg is the zero element. 

For any p G C such that p ^ pg we have p -I- po = —r where r is the third 
point of intersection of C with the line in CP2 through p and pg. This point 
r is not pg, so — r is the third point of intersection of C with the line in CP2 
through r and pg, which is the point p. Thus, p+pg = p if p yf po and p-fpo = Po 
since pg is a point of inflection, so pg is the zero element. 

To prove associativity we let p,q,r € C and Li is the line in CP2 that 
meets C at p, q, —{p + q) and similarly we let the following be lines in CP2 that 
meet C at 



L2 : Po,p+ q,-{p + q), Lg : r,p + q,-{{p + q) +r), Mg : q,r,-{q + r) 
M2 ■ Po,q + r,-{q + r) and Mg : p,q + r,-{p + {q + r)). 
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Figure B.6 

Now let D and E be the reducible curves £> = Li U M2 U L3 and E = 
Ml U L2 U M3. C meets Li at [p, q,—{p + g)] and therefore D meets C at 

q,r,p+ q,q + r,-{p+ q),-{q + r),-{{p + q) + r) 

and by a similar deduction we find that E meets C at 

Po,P,q,r,p+ q,q + r,-{p + q),-{q + r),-{p+ {q + r)). 

We have to show that {p+q) + r = p+ {q + r). We can see that C and D meet at 
nine points and that E contains eight of them. From the argument previously 
in the section, if two projective curves C and D intersect at nine points and 
if a projective curve E contains eight of them, then it also must contain the 
ninth. So —{p+ {q + r)) and ~{{p + q) + r) must describe the same point, hence 
{p+q) + r = p + {q + r) proving associativity. Note that we used Pq as the zero 
element. 
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In fact we could have used any point of C as long as we constructed our 
group around that point using the proviso that any three points of intersection 
with C add up to zero if they are three points of intersection of C with CP 2 . 

□ 



B.4 Complex Analysis 

For the final sections we need some standard results from complex analysis. 
We consider the definition of holomorphic functions and their properties and 
Cauchy’s closed curve theorem. Although some proofs are given not every result 
is proved rigorously and the reader may refer to any introductory book on 
complex analysis for the details, e.g. [Bak and Newman 82]. 

We will use the notation z = x + iy to represent a complex number and f{z) 
denotes a complex function. D{zq : r) is the open disc of radius r > 0 centred 
at zo (i.e. D is an open subset of C, or alternatively all neighbourhoods of zq 
contain such a disc.) 



B.4.1 Holomorphic Functions and Entire Functions 



A function / is only differentiable at z if 



dl 

dx 



and 



it satisfies the Cauchy-Riemann equations: 



dl 

dy 



are continuous at z and 



^^.dl 

dx ^ dy' 



Consequently it can be shown that ^ If D is an open subset of C 

. . dz dx 

and a complex number / is differentiable at every z G D then the function 
is holomorphic in D. If the first derivatives exist then it can be shown that 
all subsequent derivatives exist. The term analytic is often used to describe a 
holomorphic function. 



Example 4.11 

The function f{x + iy) = + 2ixy is holomorphic, but the function 

f{x + iy) = — 2ixy is not holomorphic. 

Functions such as polynomials that are everywhere differentiable are called 
entire functions. An entire function is therefore infinitely differentiable. 
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B.4.2 Closed Curve Theorem and Line Integrals 



B. 4-2.1 Line Integrals. If we let f{t) = u{t) + iv{t) be any continuous 
complex-valued function of real variable t, then we can write 




i{t)dt - 




where a < t < b. 



Now we let z{t) = x{t) + iy{t) where the curve determined by z{t) is piecewise 
differentiable and we set z'{t) = x' {t) + iy' {t) . If x and y are continuous on [a, b] 
and are continuously differentiable on some partition of [a, 6], we can define a 
line integral by 

[ f{z)dz= [ f{z{t))z'{t)dt 
J C J a 

where C is a piecewise smooth curve parametrised by z(t). 

If two curves C\ and C 2 are smoothly equivalent then it turns out that 





/• 



If we define —C by z{b + a 
find that 



t) (i.e. C traversed in the opposite direction) we 




If / is the complex derivative of a holomorphic function F say, where F is 
holomorphic on C then 



IC 



f{z)dz = F{z{b))-F{z{a)). 



If / is entire, then / is everywhere the derivative of a holomorphic function, 
namely there exists an entire F such that F'(z) = f{z) for all 2 (proof of this is 
not given). We will use this and previous results to prove the first incarnation 
of the closed curve theorem. 



B.4-2.2 Closed Curve Theorem. A curve is closed if its initial and end points 
coincide. If a curve C is given by z{t) and a < t < b then if z{a) = z(b) C is 
closed. We will now show that the integral along a closed curve is zero. 
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Theorem 4.23 



If / is entire and C is a smooth closed curve then 

f f(z)dz = 0. 

Jc 

Proof [Bak and Newman 82, 4.16, p 51] 

Since / is entire there exists an f{z) = F'{z) where F is an entire function. 
Hence 

/ f{z)dz= / F'{z)dz = F{z{b))-F{z{a)). 

Jc Jc 

Since C is closed, z{a) = z{b) and F{z{b)) = F{z{a)) thus 

f f{z)dz = 0. 

Jc 

This can be related to Stokes’ theorem by writing J f{z)dz as 
by the Cauchy-Riemann equations. 



B.4-2.3 Cauchy Integral Formula. If / is entire and a is some given complex 
number and C is given by 

C : Re^\ 0 < 6» < 27t i? > jaj 

then 






/W 



dz. 



IC z- a 

The proof of this theorem is not given here (see [Bak and Newman 82, 5.5, p 
55]). 



B.4.3 Liouville’s Theorem and Fundamental Theorem of 
Algebra 

B.4-.3.1 Liouville’s Theorem. An application of the Cauchy integral formula 
is Liouville’s theorem. This says that if / is holomorphic and bounded in C, 
then / is a constant. To prove this we simply calculate /(a) — /(6) using the 
formula where a and b are complex numbers and (7 is a circle centred at zero 
with R > (jall^l). We note that we can take R as large as we want so f{a) — f{b) 
approaches zero as R tends to infinity, i.e. /(a) — /(&) which implies that / is 
a constant. 
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B.4--3.2 Fundamental Theorem of Algebra. Every non-constant polynomial 
with complex coefficients has a zero in C. 

Proof [Bak and Newman 82, 5.12, p 59] 

Let P{z) be a polynomial. If P{z) yf 0 for all z then we can write f{z) = 1/P{z) 
which is an entire function. If P{z) is not constant, then P — > oo as z ^ oo so 
/ is bounded. If / is bounded, then / is constant by Liouville’s theorem. Hence 
P must be constant which contradicts the original assumption. 



B.4.4 Properties of Holomorphic Functions 



If / is holomorphic in an open disc D(a : r) and a. C D{a : r) it can be shown 
that there exists functions F and G which are holomorphic in D such that 



F'{z) = f{z) and 



^ /(z) - /(g) 



If C is a closed curve in D then 



/C 



f{z)dz 



ic 



z — a 



and 



[ ^ dz = [ G'(z)dz = G(z(6)) — G(z(a)). 

Jc z — a Jc 



Using this and the result that 



dz 



= 27rt (proof [Bak and Newman 82, 



Ic z- a 

5.4, p 55]) gives us Cauchy’s integral formula for functions holomorphic in the 
disc D 

^ dz, 0 < p < r 

iCp z-a 






where Gp : a -Ppe*®, 0 < 0 < 27 t. 

Cauchy’s theorem leads to an integral representation of derivatives of /, 
namely 










fjz) 

- a)”+i 



dz. 



This in turn leads to a power series representation for holomorphic functions 
in D{a : r) 

OO 

/(z) = ^G„(z-a)". 

n=0 
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We can choose a € D{a : r) and set p > 0 such that \a — a\ < p < r. If 
|z — a| < \a — a\ then 



/W 



1 

2m 




f{w) 
w — z 



dw, 



and using the result that 

1 z — a 

w — a {w — a)^ 



(z-af 

{w — a)3 



converges to l/(tc — z) throughout Cp we get the following convergent series 
expansion for /: 






1 z — a (z — a)^ 

1 h — 

w — a (w — a)^ (w — ct)^ 



dw. 



= Co(p) + Ci(p)(z - a) + C 2 (z - af + . . . 



where 






f(z) 



{z - a)"+i 



dz. 



As / is infinitely differentiable at a and C'„(p) = ; — ^ for each p, 0 < p < r 



then 



C„ = 



/(”)(«) 1 



n! 

m 



2ni 



'Cp 



{z-a) 



n+1 



dz 



for all z G D{a : r). 



B.4--4-1 Mean Value Property. A consequence of Cauchy’s theorem is the 
mean value property of holomorphic functions. This is where /(a) is equal to 
the mean value of / taken around the boundary of a disc centred at a, that is 
to say 

1 

/(a) = ^ J /(“ + re'^)dO. 

To prove this we simply reformulate Cauchy’s integral formula and set a = a. 



B.4-4-^ Morera’s Theorem. The converse of Cauchy’s theorem is Morera’s 
theorem which states that if / is continuous in some open subset of C, say 
and if 

// = 0 

then / is holomorphic in D. The proof of this theorem is simply to observe 
that we can define an indefinite integral F of f and then note that / is the 
derivative of the holomorphic function /. 




B. Project Example 2: Algebraic Curves 



219 



B.4.5 General Cauchy Closed Curve Theorem 



A function / can be holomorphic on a closed curve C and yet 



example 




-dz = 2m. 
z 



/ yf 0, for 



/C 



For the general case of Cauchy’s closed curve theorem we want to find where 
the theorem is valid. 



B. 4-5.1 Simply Connected Domains. The function f{z) = 1/z is the punc- 
tured plane, i.e. there is a ‘hole’ at z = 0. We want to define a region with no 
“holes”. For this we use the definition of a simply connected domain. A domain 
is simply connected if every closed curve in the region can be contracted to a 
point of C. Clearly a rectangle is simply connected since every closed curve in 
the rectangle can be contracted to a point by means of affine transformations. 
Simply connected domains are best illustrated by simple examples (see below) . 

Example 4.52 

The annulus A = {z:l<|z|<3}is not simply connected (see Figure 
B.7) 

Example 4.53 

The infinite strip S = {z : —1 < Im{z) < 1} is simply connected (see Fig- 
ure B.8) 

We can now define the general case of Cauchy’s theorem. 



B.4.5. 2 General Closed Curve Theorem. If / is holomorphic and lies in a 
simply connected domain D and (7 is a continuous smooth curve in D then 




B.4.6 Isolated Siugularities aud Removable Siugularities 



A complex function / has an isolated singularity at Zq if / is holomorphic in 
a deleted neighbourhood of Zq but is not necessarily holomorphic at Zq, i.e. 
f(z) = l/(z — 4) has a singularity at Zq = 4. 
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Figure B.7 

Now suppose that / has a singularity at zq- If there exists a holomorphic 
function g holomorphic at zq too so that f{z) = g{z) for all z in some deleted 
neighbourhood of zq then we can say that / has a removable singularity at zq, 
for example 

sin z z ^ 2 
0 z = 2 

has a removable singularity at zq = 2. 

A(z) 

If f can be written in the form , where A and B are holomorphic and 

B(z) 

A{zq) ^ 0 and B{zq) = 0 then we say that / has a pole at zq. If / does not 
have a pole or removable singularity at zq then it is said to have an essential 
singularity at zq. 
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/ 



Figure B.8 

B.4.7 Laurent Expansions 

In section B. 4.4 we saw that functions holomorphic in a disc could be repre- 
sented by a power series. A Laurent expansion is a power series which involves 
negative powers of 2 also and which converges to a holomorphic function in an 
annulus R\ < \z — Zq\ < R2, that is 

00 

/(^) = '^^k{z - Zo)^- 

— 00 

Conversely, if / is holomorphic in the annulus then it has a convergent power 
series as above with the coefficients ak given by 

/(^) . 

27 ri Jc {z - Zo)>^+^ 

where C = C{zo : R) and R is chosen so that i?i < |z| < i? < i?2- 



B.4.8 Residue Theorem 

00 

If 7 is a circle surrounding zq and f(z) = ''^Ck{z — Zq)'^ in a deleted 

— 00 

neighbourhood of zq that contains 7 then / / = 27 rtC'_i. Thus the coefR- 

J-y 

cient C_i has a special importance and it is called the residue of / at Zg- 
If / has a pole of order one at Zg and can be written A{zg)/ B{zg) then 
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C_i =Res(/ : zo) = lim {z - zo)f{z) = ■ 

B'(Zo) 

Example 4.81 



— j r has a pole at = * ^ind hence Rest— j i) = — ^ 

(z^ — 1) (z'^ — 1) 4 

We can now generalise the Cauchy closed curve theorem further to include 
functions with singularities. To do this we must first consider the winding 
number of a closed curve. The winding number is given by 



n(7,a) 




dz 

z — a 



where 7 is a closed curve and a does not lie on 7. It can be shown that for 
any closed curve the winding number is an integer. (See the examples in Figure 
B.9). If we fix 7 and vary a then the winding number is always a continuous 





Figure B.9 



function. If 7 is a semi-circle traversed anti-clockwise then 



n(7,a) 



1 a inside semi-circle 
0 a outside semi-circle. 



B.4-.8.1 Cauchy’s Residue Theorem. If / is a holomorphic function in a sim- 
ply connected domain except at isolated singularities zi, Z2, ■ ■ ■ , Zm and if 7 is 
a closed curve not intersecting any singularities then 

/ m 

f = 2 tti E n(7,2fc)Res(/, zu). 

k=i 
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For proof see [Bak and Newman 82, 10.5, p 110]. We define / as meromorphic in 
a domain if / is holomorphic there except at isolated poles. A useful application 
of the residue theorem is the evaluation of definite integrals using contour 
integral techniques. Suppose we have an integral of the form 




P{x) 

Q{x) 



dx 



where P and Q are polynomials. This will make sense if Q{x) yf 0 and if 
degQ — degP > 2, we then take 




lim / 



P{x) 

Q{x) 



dx 



and we want to estimate this for large R. Let Cr be the closed curve from —R 
to R of the real line. By the residue theorem we get 




Figure B.IO 



Jcr Q[z) V ^ 

where Zk are poles of Q contained in Cr. It can be shown that when i? — > oo 
we get 
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Example 4.83 



/ 



°° dx „ V- T. / 1 

= 27rt2^Res(^— ;2fc) 

k ^ ^ 



-ooX^+1 



and 



thus 



Res 



Res 



1 

^ 1 ’ 





Z2 = 




1 


1’ ) 


4zf 


■ p37i-i/4\ 


1 


L’ ) 


“ 4z| 



dx 7t-\/2 



/_„o a;4 + 1 



B.4.9 Conformal Mapping 

If / is holomorphic in some domain D which contains the point zq and if 
f'{zo) ^ 0 then / is conformal at zq. This can be interpreted as the angle 
between any two curves crossing at zq being preserved under /. For example 
f{z) = has a non-zero derivative at all points and is hence conformal. Under 
/ the lines x =constant and y =constant must remain orthogonal. Under / the 
lines X =constant map to circles centred at the origin and y =constant maps to 
rays away from the origin (see Figure B.ll). If / is a 1-1 holomorphic function 




Figure B.ll 



in some domain D then / ^ exists in f{D); also / and / ^ are conformal in 
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D and f{D). A 1-1 holomorphic mapping is called a conformal mapping. This 
idea of a holomorphic map under some complex function will be used in the 
next section. 



B.5 Topology and Riemann Surfaces 

In the first chapter we looked at sketches of algebraic curves in R^. However 
we cannot sketch curves in which has four real dimensions, but curves in 
do have a topology and we can get accurate topological pictures. 



B.5.1 Topology of Complex Algebraic Curves 

Consider the standard form for the non-singular cubic which we investigated in 
B.3.2.1, namely = g{x) where g{x) is a cubic polynomial. Under a suitable 
projective transformation g{x) can be written g{x) = x(x — l)(a: — A). If we 
add a point at infinity then a non-singular cubic C is given by 

C = = x(x — l)(x — A)} U {oo} 

where C is a subset of CP 2 and A € C — {0}. We want to investigate the 
topology of this object so consider the mapping 

7T : C — > CP 2 defined by 'K[x,y,z\ [x,z\ and 00 ^ [1,0]. 

This type of mapping is called a branched covering (i.e a double covering of 
C). The points which coincide in C are called branch points. In affine coor- 
dinates this is defined by (x, y) x which is a 2-1 map corresponding to 
y = Under stereographic projection CPi is homeomor- 

phic to the Riemann sphere. If we consider y as a, function of x on CPi then 
this has two values outside [0, 1, A, 00 ]. Now we can cut CPi along the two paths 
from 0 to 1 and from A to 00 . This causes the double cover of C to fall apart 
into two pieces with y being single-valued on each sheet. We now open up the 
cuts. So a non-singular cubic in CP 2 is topologically a torus. More generally it 
can be shown that any non-singular algebraic curve in CP 2 is homeomorphic 
to a sphere with g handles where g is the genus of the curve and related to the 
degree d of the curve by the degree-genus formula 

9= ^(d-l){d-2). 

What is the topology of singular projective curves? We will look at what 
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Figure B.12 




Figure B.13 

happens at the end of this section. Thus a non-singular projective curve is 
topologically a surface in CP2. However this type of surface has more structure 
and we can do complex analysis on it. This type of surface is called a Riemann 
surface. 
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B.5.2 Riemann Surfaces 

To define a Riemann surface we first need to show what we mean by a topologi- 
cal surface. A topological surface is a space S which is locally homeomorphic to 
C or . A homeomorphism (j) :U ^ V between an open subset C/ of a surface 
S and an open subset of R of C is called a chart (see Figure B.15). A collection 



V 




Figure B.15 



of all charts on S is called an atlas <P where <P = {(f>a '■ Ua ^ Va '■ a G A} and 
hence for all a G A S is the union of all [/«. 
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If we have the intersection of two open subsets in S then 
4>a-Ua^ Va and 4>0 -.Up ^ Vp 

are two charts and 

(j)a{Ua n Up) Va and </)p(Ua H Up) Vp. 

We can now construct the homeomorphism 

(j^ap — 4^a^4^p ■ Vp > Va 

which maps between two open subsets of C. This is known as the transition 
function (see Figure B.16). A surface is a Riemann surface if the transition 



0a£ 




Figure B.16 

functions are holomorphic (see section B.4.1) whenever they are defined. The 
atlas is then called a holomorphic atlas. 

Example 5.21 

The most obvious example of a Riemann surface is the Riemann sphere 
CPi = C U {oo}. We let U = CPi — { 00 } and V = CPi — {0} and we 
define 

(j):U-^C by cj)[x,y] = x/y and -tp[x,y] = y/x 
(see Figure B.17). So <j) and if form a holomorphic atlas on CPi and hence 

: C - {0} ^ C - {0} 
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Figure B.17 



are the transition functions defined by 

1 

z 



where z € C are identified with [z, 1] G CPi. 

We can now return to algebraic curves and show that a projective curve 
C — Sing{C) defined by P(a, b, c) in CP 2 forms a holomorphic atlas and is 
hence a Riemann surface. Suppose that [a, b,c] G C and also 

dP 

P{a,b,c) = 0 and (a, 6, c) yf 0. 
dy 



By Euler’s relation 



dP dP dP 

a-^(a, &, c) + b—{a,b,c) + c—{a,b,c) = 0 



which implies that a = c = 0 and hence that a = b = c = 0 which is impossible 
by the definition of C — Sing{C) as [a,b,c] would be a singularity. So either 
a yf 0 or c yf 0. Assume that c is not equal to zero, then as P is homogeneous 
we can write 



dP 

dy 



{a/c, b/c, 1) 



dy 



where d is the degree of the P{x,y,z). The implicit function theorem applied 
to P{x,y, 1) reveals that there are open neighbourhoods V and W of a/c and 
6/c in C and a holomorphic function g :V ^ W such that ii x gV and y gW 
then 



P{x,y,l) = Q<^y = g{x). 
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If V and W are small then 



U = {[x,y,z\ & C : z ^ Q, xjz &V,y/z € W} = {[x, y, 1] G Cx G V,y e W} 

is an open neighbourhood of [a, b, c] in C — Sing{C) . The map 4>U V defined 
by (j)[x, y, z] = xj z has inverse w oc [tu, g(w), 1]. Similarly if [a, b,c] G C and we 
make the suppositions 



dP dP dP 

— (a,5, c) yf 0 yf a or —(a,b,c)^0 or — (a, &, c)yfQ 

then by similar methods we can find a homeomorphism (j)U V from an open 
subset of C where (j)[x, y, zj has one of these forms 



z/x,y/z,zly,xly,ylx. 



The inverses have the following forms: 

l, 5 ('u;)] or [l,w,g{w)]. 

Now consider the homeomorphism j/, z] = a;/z which has inverse [tc, 5 (tc),l] 
and (p)j[x,y,z] = y/x which has inverse Hence, 

4>a-Ua^ Va and 4>0 :Uy ^ Vy 

where Ua and Uy are open neighbourhoods of [a,b,c] intersecting in C — 
Sing{C) and Va and Vy are open subsets of V. We can construct transition 
functions 



(f^ay ^a^^y ■ ^ ^y and (j^ya ^y^4^a ' ^0 ^ ^a- 

See Figure B.18, hence 4>ay = and (j)ya = g{w)/w. 

If we define g : H — > C to be holomorphic then the transition functions are 
holomorphic. Similarly if we adopt this method for our other forms of </> then 
we get transition functions to be one of the following: 

w ^w, l/w,g{w), l/g{w),w/g{w),g{w)/w 

where g is holomorphic and does not vanish where the transition function is 
defined. Consequently we get a holomorphic atlas on C — Sing{C) and hence 
C — Sing{C) is a Riemann surface. 
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Figure B.18 




\ 



Figure B.19 

B.5.3 Degeneration of a Cnbic 

In this section we want to investigate what happens as the non-singular cubic 
degenerates into an irreducible cubic with a node (see Figure B.19). Consider 
a curve C described by C = {y^ = — c} where c is real and small. 

As c 0, C degenerates into a cubic with an ordinary double point. To 
investigate what happens we again consider the mapping tt : C —>■ CPi 
namely a branch covering. In affine coordinates this corresponds to the map 
y = ±a/x^(x -P 1) — c which has three branch points, a,b,c say, plus a point 
at infinity. We know that CPi = C U {oo} so we represent the branch points 
on C by Figure B.20. At c = 0 under tt we get the map corresponding to 
y = {x + 1) which has two branch points at —1 and oo. So as c tends to 
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c 




Figure B.20 

zero, four branch points tend towards two branch points, see Figure B.21. It 
follows that when c = 0 we get the Riemann sphere identified with two points. 
From Section B.5.1, we know that a non-singular cubic is topologically a torus 
and we can represent the branch points a,b,c and oo on the edge of a torus. 
Topologically a singular cubic with a node can be represented by contracting 
the meridian a, 6 to a point. This is topologically equivalent to the Riemann 
sphere identified with two points, Figure B.22. As c ^ 0 we get a vanishing 
cycle for a cubic, i.e. a circle around the meridian of a torus contracting to a 
point also represented in the next figure (Figure B.23) by the real cone. These 
pictures although topologically correct do not represent how the curve lies in 
C^. In the case of the cubic with a node in the singular point is two lines 
passing through the origin. However the complex algebraic curve at the sin- 
gular point consists of two planes intersecting transversely in which is not 
possible to draw adequately. 

We also notice that the genus of the curve changes from one to zero. In 
general the genus g of a curve of degree d with ordinary singular points of 
multiplicity nii is 

9=^{d- 2)(d - 1) - XI - !)■ 
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c 




Figure B.21 




B.5.4 Singularities and Riemann Surfaces 

We have investigated the topology of singular curves but what happens at the 
singularities? It can be shown for any singular projective curve C in CP2 we 
can construct a surjective map tt : R C where i? is a compact Riemann 
surface. The singular points of C are the image of a finite set of points in R 
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Figure B.23 



under tt, and on the complement of this set tt defines a holomorphic bijection 
TT : R — TT~^{Sing{C)) C — Sing{C). 

We call this a resolution of singularities. 



B.6 Further Topics 

In this final section we briefiy look at some other ideas which lead on directly 
from what we have learnt in complex analysis. The discussion is informal and 
proofs are hence omitted. 
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B.6.1 The Weierstrass Function 

We let L = {nwi + mw2 : n, m G Z} where L is a lattice in C which inherits a 
natural group structure of C, see Figure B. 24 . There is a meromorphic function 




Figure B.24 



in C defined by 

p{z) = Z~‘^+ X! \{z - + W~'^] 

w^L—{0} 

with derivative 

p'(^) = XI -y-wy 

wGL 

where p{z) is called the Weierstrass function associated with the lattice L and 
w = nwi + mw2, i.e. we can choose any n and m to obtain w. 

It can be shown that p satisfies the identity 

p'{z) = 4 p(z )2 - g2p{z) - 93 

where 

52 = 52 (i) = 60 ^ w~^, and 93(1) = 140 ^ 

liieL— {0} loeL— {0} 

Using this identity we can define a non-singular projective curve Cl by 

y‘^z - + 92XZ^ + 93Z^ = 0 



with 92 and 93 as before. 
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We know that the lattice L has an additive group structure and is a sub- 
group of C so we can form a quotient group 

C/L = {L + a: LgZ}. 

We can investigate the topology of C/L by the map tt : C ^ C/L. The effect 
of this map is to “glue” appropriate etches of the parallelogram together. Hence 
C/L is topologically a torus. 

It can also be shown that the homeomorphism <f) : CjL ^ Cl defined by 

+ 10.1,0] .et 

is holomorphic. So we can associate a lattice L in C to a non-singular cubic 
curve Cl in CP2 and conversely it can be shown that every non-singular cubic 
curve comes from some Cl- 

If we are given Cl how do we get the lattice L? To do this we need to 
integrate along a smooth path in a Riemann surface. 



B.6.2 Differential Forms on a Riemann Snrface 

To define integration on a curve we need to show what we mean by a smooth 
path. A piecewise smooth path in a Riemann surface S' is a continuous map 7 
from a closed interval [a, b] in R to S such that if (j> : U ^ V is a holomorphic 
chart on an open subset U of S and [c,d\ C then (/07 : [c, d] ^ F is 

a piecewise smooth path in the open subset F of C. We have a closed path if 
7(0) = 7(6). 

In order to integrate in S we need to define the concept of meromorphic 
and holomorphic differentials. A meromorphic function on a Riemann surface 
S is a function / : S CPi which is holomorphic and is not identically 00 on 
any connected component of S. 

Let / and g be meromorphic functions on an open subset of a Riemann 
surface S. We define the symbol fdg as a meromorphic differential on S. If / 
and g are meromorphic functions on S then we say that fdg = fdg if and only 
if for every holomorphic chart (p : U V on an open subset U of S' we get 

(/o0"^)(5o0“^)' = {fo<p~^){go(j)~^y. 

We say that the meromorphic differential fdg has a pole at (f>{p) . If fdg has no 
poles then it is defined as a holomorphic differential. 
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The integral of a holomorphic differential on S along a piecewise smooth 
path 7 : [a, 6] — > S' is given by 

rb 



fdg= f fcrt{t){g<yyYdt 

J a 



where t is a parameter independent of the choices of fdg. 
If the Riemann surface is C then 



fdg= f f{z)g\z)dz 
J -y 



from the normal form in complex analysis. 



B.6.2.1 Abelian Integrals. An Abelian integral is an integral of the form 

[ .fdg 
J 7 

where / and g are rational functions on C — Sing{C) and 7 is a piecewise 
smooth path in C — Sing{C) not passing through the poles of the meromorphic 
differential fdg. We assume that C is not the line at infinity defined by 2 = 0 
and we take the rational function g to be [x,y,z] xjz and we hence work 
in affine coordinates [x,y, 1] and write dx for dg. Consequently, / becomes a 
rational function R{x, y) and the integral is written 



fdg= / R{x,y)dx. 
J -y 



A special case of an Abelian integral is given by the integral over the curve Cl 
as defined in Section B.6.1 where Cl is associated with the lattice L. 

We define 71 and 72 to be closed smooth paths in C/L, see Figure B.25. 
Any closed path on C/L can be obtained by using combinations of 71 and 72 
and we can traverse these paths as many times as we want. It can be shown 
that there is a meromorphic differential y such that 

L = n y + m y = nw\ + mw 2 where n,m G Z. 

J71 J 72 

In Section B.6.1 we defined the homeomorphism (j) : C/L — > Cl and if we work 
in affine coordinates y is a, rational function and it can therefore be shown that 



L = 



y 



^dx 



where 7 is a piecewise smooth curve in Cl. Hence given any non-singular 
projective cubic we get a lattice L in C. As C has a natural additive group 
structure it makes sense that the cubic has a group structure as discussed in 
Section B.3.3. 
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Figure B.25 

B.6.3 Abel’s Theorem 

In Section B.6.1 we saw that a complex torus C/L is biholomorphic to the 
cubic Cl in CP 2 . Using this result and our investigation into differential forms 
leads to Abel’s theorem for cubics which gives an addition formula for elliptic 
integrals. 

Theorem 6.41 

li t,v,w G C then t + v + wGL if and only if there is a line in CP 2 whose 
intersection with Cl consists of points (f){L + t), <f){L + v) and (j){L + w) where 
(j) is the homeomorphism defined in Section B.6.1. 

Equivalently, if p, q, r lie in Cl then 

rP rq rr 

L+ y~^dx+ / y~^dx+ / y~^dx = L + 0. 

•^[0.1,0] -^[0,1,0] -^[0,1,0] 



Proof[Kirwan 92, 6.23, p 155]. 
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Epilogue 

This project report reads quite well, if not as well as that in Appendix A. The 
material is more accessible, indeed the chapters on complex variable could form 
part of the second year curriculum. In this instance this particular student could 
not take any modules on complex analysis other than elementary first year ma- 
terial on complex numbers. Some of the later ideas involving topology, Riemann 
surfaces and the Weierstrass function are quite advanced. The assessors for this 
project were considering the possibility of a good second class degree or perhaps 
going even higher. Then came the oral examination where two assessors asked 
the student questions on the written project report. 

This was conducted in a friendly supportive way, but when it came to saying 
anything sensible, at the oral examination the student was completely tongue 
tied. There was no evidence that the student understood what had been written; 
even the more elementary questions were left unanswered. There are several 
reasons why this could happen. The nasty reason is that someone else really 
has written the project! If this can be proved, or the student confesses, a zero 
mark has to be awarded. For this project, this was not the case. Alternatively 
the student could be very nervous and, in the stressful situation of an oral, 
completely dry up. The student could be swept along during the year by an 
over enthusiastic supervisor, so much so that even the student thinks (s)he 
understands everything until the crunch comes and (s)he finds all questions 
at the oral baffling. A third reason could be that the student could be too laid 
back, and there is usually sufficient time between submitting a project report 
and the oral to forget everything, or at least put it beyond recall amongst all 
the other stuff needed for finals that are looming large. Whatever the reason, 
the student did him/herself no justice at all at the oral. The compromise mark 
was given in the mid 50s, corresponding to a lower second class degree. The 
external examiner was probably surprised the mark was this low until he read 
the reports from the assessors. It is very disappointing for both student and 
supervisor (first assessor) when this happens. 




c 

Project Example 3: Water Waves on a 

Sloping Beach 



Preamble 

In contrast to the first two appendices, this project example is very much ap- 
plied mathematics. The mathematics is much more elementary, but there is 
the possibility of relating it to the real world. This project thus resembles the 
boomerang project outlined in Section 3.5 or the hurricane project of Section 
3.6. Although the equations governing water waves are not in the course, their 
solution as presented here could be easily understood by second or even first 
year undergraduates. In order to get high marks in such a project therefore, 
it is necessary to either do some advanced mathematics, e.g. stochastic waves, 
non-linear waves, or to relate the simple mathematics to a real world problem 
and to understand the limitations of the model, how it might be extended, pos- 
sible numerical work, etc. It will be seen that unfortunately this project does 
not really do this. 



C.l Abstract 

In this project I have studied the basic Surface Water Wave equations to obtain 
a linear model for the Free Surface and Velocity Potential. From then on right 



241 




242 



Managing Mathematical Projects - With SuccessI 



running wave equations were used and from these the wave velocity and motion 
of the particle are calculated which could be used to illustrate a breaking wave 
in shallow water. After this wave rays were studied to be able to look at the 
direction of the waves on a sloping beach and show refraction of the wave crests 
as they approach the shore line. 



C.2 Introduction 

Waves on the ocean are generated by wind, which you can see as when there are 
high winds the sea is rough and on calm days the sea can be almost flat. Wind 
dislodges the surface of the water and gravity endeavours to put it back to 
its original state, nature tries to remain in equilibrium. These waves generated 
continue travelling on the open ocean, varying in size depending on conditions, 
they are dampened if opposing the direction of the wind or can grow and 
travel faster when going with the wind. They will obviously eventually meet 
such obstacles as land! 

Something very interesting happens as the waves approach land, i.e. a beach, 
they refract and the wave crests turn parallel to the shoreline. You will also 
notice that the waves break on the beach. 

I have read various books on this subject and found that Ocean Engineering 
Wave Mechanics by M.E. McCormick and An Introduction to Water Waves by 
G.D. Crapper to be the most helpful in my understanding of the subject. 

To begin with we will calculate a wave equation for Linear Surface Waves, 
the free surface is modelled by rj at some position x and time t, and z = 0 is 
assumed to be the still water level as illustrated in Figure C.l. 



C.3 Surface Waves 

The Free Surface is defined by 



z = rj{x,t)- (C.l) 

The flow is assumed to be irrotational 

yxV = 0. (C.2) 

Therefore the fluid potential V_ can be represented by a potential function (f> 

V = V4>. (C.3) 
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Figure C.l Travelling surface wave. (Adapted from Ocean Engineering 
Wave Mechanics by M.E. McCormick, John Wiley and Sons Inc. Reprinted 
with permission.) 



Continuity states that 

V.F = 0. 

From Equation (C.3), (C.4) becomes 



(C.4) 

(C.5) 



That is Laplaces Equation. 



C.3.1 The Current 

Suppose we impose a uni-directional current 



u = Ui. 



(C.6) 



This current produces additional velocity potential (j) of the form 



< 1^1 



d(t>i 

dx 



Ux + (j> 
U. 



However here it will be assumed that there is no current. 



(C.7) 

(C.8) 
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C.3.2 The Boundary Conditions 



Before we can solve Laplaces Equation there are some boundary conditions we 
need to impose. 

Suppose that S' is a surface in a fluid, moving with the fluid, and fluid 
already in S remains there. 

Expressing S as the equation 

S{x, z, t) = 0. (C-9) 



Also particles in S remain there with varying x, z and t which gives us 

,o.ro, 

where D denotes differentiation following the fluid. 

Now with our free surface (C.l), S can be written as 

S = rj{x, t) — z = 0. (C-11) 



The Chain Rule states that 

Ds ds ds dx ds dz 

Dt dt dx dt~^ dz dt 

dx dz 

— and — are the components of the velocity V(u , , v, w) (v = 0). 

LLL (JjL 

Therefore Equations (C.ll) and (C.12) gives 
Ds ds 

— = — + V.Vs = 0 
Dt dt 

dr] 



(CM2) 



with 



d(j> d(j) 

u = w = 



dx dz ' 

With no current and the second set of Equations (C.15), (C.14) gives 

dr] d<j) 

dt = ih = 

This is called the Kinematics Surface Condition 
On the sea bed 

2 = —h{x). 

Therefore Equation (C.13) gives 



(C.13) 

(C.14) 

(C.15) 

(C.16) 

(C.17) 



u — — h = 0 eX z = —h 
ox 



(C.18) 
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and once again assuming no current, on the sea bed, (C.18) gives 

d(j) 



dz 



= 0 



at z = —h. 



(C.19) 



This is the sea bed boundary condition. 

Euler’s Equation of motion for inviscid flow is 

dV -1 

+ (Z.Y)Z = — Vp. 
at p 



(C.20) 



From continuity, (C.4), the second term on the L.H.S of (C.20) is zero. Hydro- 
static balance is 

p = pg{v - z) 

and recall that 



_ dp dp dp 
^ ^dx'dy^dz^' 



Therefore hydrostatic balance implies 

— I dp 



dp 



p dx ^~dx 

and Euler’s Equation of motion becomes 

dV __ dp 
Ih ~ ~^l)x' 

Representing V as in (C.3), (C.24) can be written as 



d ,d(j) dp 

dx dt ^ dx 



. f , 

dxdt’^'' ~ 



-g 



dp 

dx 



d(j) 

m 



= — pp -P const. 



Taking the constant of integration to be zero, we obtain 

1 del) 

p= — — atz = ? 7 . 

g dt 



(C.21) 

(C.22) 



(C.23) 

(C.24) 

(C.25) 

(C.26) 

(C.27) 

(C.28) 



This is called the Dynamic Surface Condition. 

To avoid difficulties when solving Laplaces Equation, with the imposed 
boundary conditions, at z = p, it is assumed that the mean surface level is 
at z = 0. 
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C.3.3 A Separable Solution of Laplace’s Equation 



It is assumed that (p has a separable solution of the form 

4> = X{x)Z{z). 



(C.29) 



Applying Laplaces Equation to this gives us 

1 (fX _ -1 (fZ 
X dx'^ Z dz'^ 



We can now solve for Z 




of which a general solution can be found to be 



Z = C'sinh(A:z) + D cosh{kz). 



(C.30) 

(C.31) 



(C.32) 



Using the boundary condition on the sea bed, (C.19), we have 

dZ 

— = 0 at 2 ; = —h (C.33) 

dz 

where C and D are constants. Using condition (C.33) we can find that 



^ ^ sinh(fcfe) 
cosh(fc/i) ' 



(C.34) 



Therefore we now have Z{z) as 

Z{z) = D cosh{kz + kh) (C.35) 



Similarly for X 


{CM) 


We get the general solution 




X{x) = Asin(fca;) + Bcos{kx) 


(C.37) 


where A and B are constants. 

Now using the two surface conditions (C.16) and (C.28) 


we can obtain 


d(t> 1 d^4> 

Z + = 0 at z = 0. 

oz g ot^ 


(C.38) 


From this we can find an expression for T{t) 




1 

TDksmh(kh) H -^Dcosh(kh) = 0 

9 dB 


(C.39) 
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Making 

w = a/(/A: tanh(fc/i) (C.40) 

^ + = 0 (C.41) 

with the general solution being 



T{t) == Es\n{ujt) + F cos(ujt). (C.42) 

E and F are constants Letting B and F equal zero and putting together our 
X, (C.35), Z, (C.31), and T, (C.42) gives us the velocity potential 

(j> = A sin(a;t) sin(fca;) cosh{kz + kh) . (C.43) 



The constants A, D and E are written as the one new constant A. 

Now that we have a solution for (j) we can use the Dynamic Surface condition, 
(C.28), to obtain an equation for the free Surface, rj. 



rj = 



Afi cosh{kh) sm{kx) cos(wt) 



Letting 



a = -- 



Auj cosh(kh) 



9 



Using this in Equation (C.44) we have 



(C.44) 

(C.45) 



rj = asm{kx) cos(wt). 



(C.46) 



Other solutions of this but with different combinations of cosine and sine are 
obtained when we allow, say, B and E to equal zero, (or A and E, or B and 

F) 



V = 


asin{kx) sin(wt) 


(C.47) 


V = 


acos(fcx) cos(wt) 


(C.48) 


V = 


acos(fcx) sin(u;t). 


(C.49) 



Because the solutions we have obtained are for linear waves we can use the 
property of Superposition to add together two of our /r’s to make left and right 
running waves. 

Left running wave rj = asin{kx + ujt) (C.50) 

Right running wave rj = a cos{kx — cot) . (C.51) 

We then have corresponding Velocity Potentials 

(j) = A cosh{kx + kh) cos{kx + Lot) 

(j) = A cosh{k z + kh)sm{kx — ujt). 



We shall only be looking at right running waves. 



(C.52) 

(C.53) 
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C.4 [No Title] 

C.4.1 The Velocity of the Waves 



If the coordinate system is allowed to move with the argument of the cosine in 
Equation (C.51), then 

kx — ut = const. (C.54) 



So we can see that the differentiation of this is equal to zero which leads us to 
the wave velocity, c 



dx u) 
dt k 



(C.55) 



and using our value of to from (C.40) 



c = 



gta,nh(kh) 



(C.56) 



C.4. 2 The Group Velocity of the Waves 



This is the velocity of a group of waves with the same amplitude, a, but slightly 
varying wave number, k, and frequency. Take two waves with waves numbers k 
and 6k, and frequencies with uj and 5uj. Then their combined velocities would 
be 

Suj 






and as 6k and 6u) tend to zero, 



Co — 



dco 

dk' 



Then differentiating ui from (C.40) we have 

gkhsech^ (kh) 



(C.57) 



(C.58) 



(C.59) 



C.4. 3 The Motion of the Particles 



Remembering that the velocity components of V are 

d(j) 



u = — = U 

ax 



dz ' 



(C.60) 
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Now that we have our velocity potential we obtain solutions to these 



agkcosh{kz + kh) cos{kx — cot) 
Locosh{kh) 

agk sinh(fc 0 + kh) sin(fcx — cot) 
Locosh{kh) 



(C.61) 

(C.62) 



The particles can take on new coordinates, (a, P), with the origin at z = 0 and 
some point x = xq- So now u and w vary with time and we have 



u = 



da 

dt 




(C.63) 



We can integrate u and w with respect to t to find expressions a and P 



P 



a = 



agk sin(/ca:o — cot) 






agktanh{kh) cos{kxo — cot) 



—LO^ 



(C.64) 

(C.65) 



Equations (C.64) and (C.65) can be rearranged, squared and added together 
to give 

2 

kag\ / kag tanh{kh) 

^ ) V 

Thus from this we can see that the motion of the particles is an Ellipse which 
is affected in shape by the depth of the water, h In very deep water we have a 
large value for h, i.e. as /i oo tanh(fcZi) => 1 So we can see from Figure C.2 
that the particles travel in a circle in deep water However in shallow water with 
decreasing h, i.e. as h => 0 tanh(/c/i) 0 The particles travel in an Ellipse 
which will become flatter, as shown in figure C.2 until they are eventually 
travelling in a flat line, as you see with waves on a beach. 




C.4.4 Breaking Waves in Shallow Water 



The velocity of a wave in shallow water can be approximated, for small h 



tanh(/c/i) « kh. 



(C.67) 



Then 



CO = k\/gh. (C.68) 

Thus the velocity of the wave c, is approximated using equations (C.56) and 
(C.67) to 



c = a/^. 



(C.69) 
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Figure C.2 Motion of the particles 



Now the horizontal velocity component of V, u in shallow water is approximated 
also, once again for small h 

coshfc/i«l (C.70) 

and so using (C.68), (C.70) our u from (C.59) becomes 

agcos{kx - ujt) 

“ = ^ 

We can see that the velocity of the waves, in (C.69) decreases as the depth 
decreases but the velocity of the particle in (C.71) increases with decreasing 
depth. Therefore there is a point at which the velocity of the particles will 
begin to be faster than that of the wave. 

When the velocities are equal (C.69 = C.71) 

acos{kx — ujt) = h. (C.72) 



If this occurs at the top of a wave, the crest, i.e. cos{kx — ut) = 1. Then the 
wave will break at, from (C.72), h = a As u becomes greater then c the wave 
will spill over. The linear solution of 77 that we have obtained gives the crest 
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Figure C.3 The motion of a particle in shallow water. (Adapted from 
Ocean Engineering Wave Mechanics by M.E. McCormick, John Wiley and 
Sons Inc. Reprinted with permission.) 

a sinusoidal shape, however as we can see from real waves on the beach it is 
evident that the crest takes on a slightly more pointed tip, as shown in Figure 
C.4 A non-linear solution for r] would model this much better. 




h 

1 

I 

X 



Figure C.4 The crest of a wave. (Adapted from Ocean Engineering Wave 
Mechanics by M.E. McCormick, John Wiley and Sons Inc. Reprinted with 
permission.) 
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C.5 [No Title] 

C.5.1 Plane Waves 

A plane wave is a 2 dimensional wave not now propagating along the x axis but 
in any direction, as shown in Figure C.5 We now have the coordinate system 







Figure C.5 

{xi^X 2 ,z) and the wave number k is represented as a vector k, where 

fc = (fccos 0, fcsind) = (fci, fc 2 )- (C.73) 

From this we need to find a new expression for kx in our Free Surface. 

From Figure C.5 we can see that 

xi = xcosO (C.74) 

X 2 = a;sin6! (C.75) 

which gives us x in terms of Xi and X 2 

a; = cos 6* + X 2 sin 0. (C.76) 

Therefore we have 

kx = x\ki + X 2 k 2 = k.x. (C.77) 

The Free Surface of a plane wave is now expressed as 

r] = acos{k.x — ujt) (C.78) 

and Velocity Potential 



(f) = Acos\i{kz + kh) sin(fc.a; — wt) 



(C.79) 
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with corresponding vector wave velocity 




(C.80) 



C.5.2 Wave Rays 



The equations obtained so far have been for an infinite train of uniform waves, 
but in reality this is not always the case. We are now going to allow small and 

LO 

slow changes in, wave amplitude, a, frequency, — , wave number, k, and depth, 

ZTT 

h. The mean surface level previously assumed to be zero is also now a variable, 
say, h. So our depth is now 

d=b+h. 

So our Free Surface changes again to become 

rj = b + cos <1> 

where the phase function 



= k.x — ojt 



and 



(j) = Acos\i{kz + kd)siri{k.x — ut) 

u! = \/ gk ta,nh{kd) . 

We can find expressions for fci, ^2 and uj from the phase function (C.83) 



ki = 



d<P 

dxi 



k2 = 



dd> 

dx2 



UJ = 



d<P 



(C.81) 

(C.82) 

(C.83) 

(C.84) 

(C.85) 

) 

(C.86) 



We can show that holds for the slowly varying changes in the wave train 
talked about on the previous page. Using the Taylor Expansion near a fixed 
point {xio,X 2 o,to) 



^{xi,X2,t) = '^{xiO,X20,to) + {xi-Xio)T, 

UXi 



d<P 

+ (x2-a;2o)^ — 

1 0X2 



0 '^''0 
(C.87) 

(neglecting higher derivative terms). Using Equations (C.84) and (C.85) it can 
be shown that 

= kifjXi + k2oX2 — Wot. (C.88) 



Thus hold locally. 
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It is assumed that the second derivatives of <P are continuous, so Equations 
(C.84) give us 



dki 


dk2 


(C.89) 


dx2 


dxi 


dki duj 




(C.90) 


dt dxi 


= 0 


dk2 duj 




(C.91) 


dt dx2 


= 0. 



The depth is now a slowly varying function of xi, X 2 and t. So uj can be repre- 
sented as a function of (fci, k 2 , xi,X 2 , t) 



uj{k,d) = n{ki,k2,xi,X2,t). 



(C.92) 



From the group velocity Cg = — the vector group velocity could be assumed 

CLK 

to be 

' dn dn' 

^dki ’ dk2 , 

Using Equations (C.90) and (C.92) 

duj dQ dki dQ dk2 dk\ 

dxi dxi dki dxi dk2 dx\ dt 



fdndfl\ , 



(C.93) 



(C.94) 



Substituting Equation (C.89) and the vector group velocity (C.93), (C.94) can 
be written as 



dQ 


dk^ 


dki 


dki 


(C.95) 


dxi 




dx2 ' 


Now the total derivative 

Dki 

Dt 


= t + (5iV)*. 




(C.96) 


Dki 


dk^ 


dki 


dki 


(C.97) 


Dt 


= + — >■'“2 

ot OXi 


dx2 ' 



You can see from comparing (C.97) and (C.95) that the L.H.S. of (C.95) is also 
a total derivative but moving with the vector group velocity, Cg and not the 
particle velocity. We have 



dx\ dQ 

dt dki 

dx2 dQ 

dt dk2 



(C.98) 



(C.99) 
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These define lines everywhere that are parallel to the vector group velocity and 
are called Rays 

And on these rays from Equation (C.95) 



dk\ dki 


dxi 


dki dx2 dki 


do 


(C.lOO) 


dt dt 


H f 

dt 


dxi dt dx2 


dxi ' 


And from deducing — — and 
0 x 2 


using the same method as m 
dt 


(C.94) we have 


on the rays that 


dki 


dfl 




(C.lOl) 




dt 


dxi 






dk2 


do 




(C.102) 




dt 


dx2 






dui 


do 




(C.103) 




dt 


Ik' 




From (C.92) the rays can also be written as 








dxi 


doj ki 




(C.104) 




dt 


dk k 






dX2 


doj k2 




(C.105) 




dt 


lm~k 





which is the form we will use in the next section. 

These rays represent the direction of travel of the waves and the wave crests 
are perpendicular to this. 



C.5.3 The Waves Approaching a Beach 

Using the Ray equations established previously we can predict what will happen 
to the waves as the depth of water decreases, i.e. at a sloping beach. The figure 
below shows the sea in the x\ less than zero region and the shore is the X 2 
axis. To begin with we will look at the waves in the deep water, at x\ = — oo, 
before they are affected by the sloping beach. We shall also assume again that 
the variation in the mean surface level, b, is negligible and is zero. We shall 
assume there is no variation on depth, so 17 (C.92) is independent of xi,X 2 , 
and t which from (C.102) gives w as a constant on the rays and so a constant 
everywhere. And in deep water using (C.65) we have 

= gk. (C.106) 

The previous assumption also tells us that from (C.lOO) and (C.lOl), ki and k 2 
are constant on the rays in deep water. Dividing the ray equations in (C.103) 




256 



Managing Mathematical Projects - With SuccessI 



Shore line 
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Figure C.6 Wave rays in deep water 



we have 

dX2 _ k2 
dxi k\ 

And as k\ and k 2 are constants we can integrate this 




X2 = ^Xi + X3 

ki 



(C.107) 



(C.108) 

(C.109) 



where X 3 is the constant of integration. Equation (C.107) shows us that the 
waves in deep water travel in straight lines The wave crests are perpendicular 
to the rays so for the crests (C.105) can be written as 



dx2 k\ 

dxi ^2 



(C.llO) 



which is integrated to give 




(C.lll) 



where Xi is the constant of integration. 

These equations (C.107) and (C.109) tells us that in deep water the waves 
are not parallel to the shore (as the wave number is never zero), as shown in 
Figure C.7. But when standing on a beach the waves more often than not run 
up parallel to the beach, as in Figure C.9, so now we look at the wave rays 
when in shallow water. 
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DEEP WATER 

Figure C.7 Direction of waves in deep water 



C.5.4 Wave rays in shallow water 

We have established that w is constant on these rays, so 

= gktax\k{kd) = const. (C.112) 

Thus as we approach the beach and the depth decreases, k must increase to 
keep a constant, so if k increases the wave velocity 
to ^ 

c = — will decrease, i.e. the waves will slow down 
k 

and the wave length 
27T 

A = — will also decrease, shortening the waves. 
k 

We can see from this that as the wave nears the shore it will slow down and 
turn parallel to the shore, refraction of the waves. 

We can write the depth of the water when it is shallow as a fraction of x\ 
where the slope has a gradient of 7 

d=— 7 x 1 . (C.113) 

From the shore, i.e. along the X 2 axis, the waves look the same, thus 

d 

dx2 



(C.114) 
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So from (C.lOl) 



dk2 

dt 



= 0 



fco = const = TO. 



Now that 



^2 = TO ^ k\ = \/fc2 _ jjil 

and Equation (C.107) is now 



TO 



dxo 



dxi y/k2 _ ^2 



For small d 



tanh(fc(i) ~ kd. 

Using (C.119) and (C.113), (C.112) can be written as 

= gk^{-^xi). 

So k can be written as a function of x\, and letting x\ = 



- 



Thus the differential equation (C.118) is 

TO 



dX2 

dxi 



97kll 



And if \xi \ is small enough 



d\xi\ 

We can integrate (C.123) to give 



dx2 lgi\xi\ 

= m‘ ' 



LU 



2 ' 



X2 = X5 



2m^/^\xi\2 



3w 



(where x^ is a constant of integration) 
Once again we have for the crests 



dx2 1 uj'^ 
d\xi\ mV gj\xi\' 



Also integrated gives 



2lo\xi\^ 



- X(, 



(C.115) 

(C.116) 

(C.117) 

(C.118) 

(C.119) 

(C.120) 

(C.121) 

(C.122) 

(C.123) 

(C.124) 

(C.125) 



X2 = 



(C.126) 
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Figure C.8 

(where xq is the constant of integration) 

We now have from (C.124) and (C.126) the direction of travel of the waves 
and shape of the crests as w, m, 7, g, X 5 and xq are all constants. So we can draw 
the basic shape of these curves. Figure C.8 shows the basic form the wave rays 
and crests take approximated by (C.124) and (C.126) with varying constants of 
integration. This figure would obviously vary depending on the other constants. 
Comparing Figure C.8 to Figure C.9, real waves on a beach you can see that 
the solutions obtained are quite a good approximation to the actual waves. 



C.6 [No Title] 

C.6.1 Conclusion and Discussion 

The solutions obtained, although only fairly simple approximations, I feel are 
good approximations to the direction of travel of waves on a beach, as can be 
seen by comparing Figures C.8 and C.9, even with the assumptions made. 

I feel that the next thing to go on to here would be to look at a non-linear 
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Figure C.9 Wave refraction at West Hampton Beach, Long Island. (Re- 
produced from Shore Protection Manual, Volume 1, Third edition, 1977, US 
Government Printing Office.) 

Wave Theory, with which we could model the breaking waves on the beach to 
show the peaks the crests form. Stokes Theory for non-linear waves suggests 
letting the wave equations be represented as a series of small perturbation, the 
higher the order the more accurate an approximation is obtained. 

There are many aspects of water waves that can be studied further and in 
more detail and I have only really touched the surface! Such things as the V 
shape pattern of waves generated by a moving object on the surface, whether 
the object is a streamlined speed boat or a lump of wood, or Kelvin’s Theory 
of the wave pattern created by a moving ship. Another interesting wave is the 
solitary wave, a single ‘bump’ that travels uniformly. 
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Epilogue 

This student was one of the many that came to a lecturer and wanted to do 
“something about sailing”! The fundamental problem with this project is its 
lack of scope. Initially, the student and lecturer decided on trying to model the 
movement of floating bodies, hoping to treat the effects of wind on a sail in 
a rudimentary way. However, it soon became clear that this was not going to 
happen, and pragmatism restricted the project student to the study of water 
waves. Even given this, the project still failed to come up to expectations. Only 
elementary linear water wave theory was covered; there were no generalisations 
even though there was some relevant course material on hand. Something such 
as a description of the Gerstner wave, perturbation theory as used by Civil En- 
gineers, even perhaps Cnoidal waves would have at least raised the level a little 
above the mundane, but no. This project ended up both limited in scope and 
short on relating water wave theory to lecture course material on inviscid flow 
or waves. 

The style of the project itself was also rather immature. There were mistakes 
in the English (most of which remain), slips in the equations (most of which 
remain) and errors in the numbering of equations (all of which have been elim- 
inated through the use of ETpflC). The layout of the project here is better than 
the original, again due to the use of WTppC. 

At the oral examination, things did not improve. The student was very quiet 
and gave a lack-luster performance. This seemed not to be due to nerves, more 
to a lack of preparation. It was almost as if the student realised that the project 
was irretrievably poor and had written it off. The most elementary questions 
were left unanswered. 

There was enough here for a pass. The mathematics though elementary 
was (mostly) correct. The content though meagre was just acceptable. This stu- 
dent was unanimously but with some regret awarded a third class mark for 
this project. For the record, the student went on to gain an upper second on 
the strength of examination performance. Perhaps there really was a deliberate 
channelling of effort towards exams in this case. 
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